提交 · cbe64cc1cfc6d7883ce1eadebf365273bd24e352 · PaddlePaddle / Paddle

26 8月, 2022 1 次提交
- W
  
  [Eager] delete final state pre-name (#45306) · 126940b3
  由 wanghuancoder 提交于 8月 26, 2022
  
  126940b3
03 8月, 2022 1 次提交
- S
  Add use_hierarchical_allreduce for DistributedFusedLAMB (#44821) · c770053c
  由 sneaxiy 提交于 8月 03, 2022
```
* add use_hierarchical_allreduce

* support hierarchical allreduce for more cases
```
  c770053c
27 7月, 2022 1 次提交
- W
  Phi average accumulates migration (#44554) · eafd4280
  由 Wang Bojun 提交于 7月 27, 2022
```
* move average_accumulates op to phi kernel
```
  eafd4280
09 6月, 2022 1 次提交

Add nproc_per_node for DistributedFusedLamb (#43295) · 6678def9

由 sneaxiy 提交于 6月 09, 2022

* add nproc_per_node for DistributedFusedLamb

* fix nproc_per_node communicator bug

* fix ring_id = 1 init bug

* fix ci

* fix test_parallel_executor_mnist.py

6678def9

07 6月, 2022 1 次提交
- S
  Add use_master_acc_grad for DistributedFusedLamb (#43266) · 601d7a35
  由 sneaxiy 提交于 6月 07, 2022
```
* add use_master_acc_grad

* add ut
```
  601d7a35
05 6月, 2022 1 次提交

【code format check upgrade】 step2：yapf (#42944) · a072fca8

由 Sing_chan 提交于 6月 05, 2022

* use yapf to format all python file

* yapf exclude two unittests file for they rely on writing and reading file, and format will break them

* disable diff_py_file because too many diff files cause command following failed

a072fca8

10 5月, 2022 1 次提交

improve introduction of bfgs args (#42191) · 000edfd2

由 Sing_chan 提交于 5月 10, 2022

* improve introduction of bfgs args; test=document_fix

* modify according to zhouwei's comment; test=document_fix

000edfd2

28 4月, 2022 1 次提交

Add gradient merge for DistributedFusedLamb optimizer (#40177) · 108aeb28

由 sneaxiy 提交于 4月 28, 2022

* add gradient merge for DistributedFusedLamb

* use master acc gradient

* fix CI ut

* polish

* remove math_function_impl.h change

* fix test_update_loss_scaling_op.py

* try to fix XPU/NPU CI

* add gm ut

108aeb28

14 4月, 2022 1 次提交

fix bfgs_doc (#41505) · 7f73ef2c

由 Sing_chan 提交于 4月 14, 2022

* fix bfgs_doc; test=document_fix

* add parameter name; test=document_fix

* modify according to chenlong's comments;test=document_fix

7f73ef2c

08 4月, 2022 1 次提交
- S
  Fix cv2 import error and some issues for lamb (#41500) · 1ed1a97b
  由 sneaxiy 提交于 4月 08, 2022
```
* fix image cv2 import

* fix lamb
```
  1ed1a97b
07 4月, 2022 1 次提交
- S
  Add Output(Step) to DistributedFusedLamb optimizer (#41249) · e4459a40
  由 sneaxiy 提交于 4月 07, 2022
```
* add Output(Step) to distributed fused lamb op

* add _set_step
```
  e4459a40
04 4月, 2022 1 次提交
- S
  cut off relation between xk and initial_position's graph (#41371) · afb56e8c
  由 Sing_chan 提交于 4月 04, 2022
```
* cut off relation between xk and initial_position's graph

* fix_bug

* add detach to cut off with original graph
```
  afb56e8c
01 4月, 2022 2 次提交

change vjp to paddle.grad (#41231) · 34241dd1

由 Sing_chan 提交于 4月 01, 2022

* change vjp to paddle.grad

* use grad and gradients api

* fix preprocess for x

* fix a bug, val_and_grad should return a Tensor

* detach value and grad to avoid assign error
Co-authored-by: Nlevi131 <limaolin01@baidu.com>

34241dd1

S

fix bug of bfgs example code;test=document_fix (#41195) · db948373
由 Sing_chan 提交于 4月 01, 2022

db948373

31 3月, 2022 1 次提交

[New API]: miminize_bfgs and miminize_lbfgs (#40710) · e7928a06

由 Sing_chan 提交于 3月 31, 2022

* [New API]: miminize_bfgs and miminize_lbfgs

* modify for python module call correctly

* add functional package, add error raise in static_graph, change assign to set_value

* unify static_graph and dygraph, fix bug when x or H0 is float64

* now only accept input is tensor, put check args in utils.py, put exception test together

* temp

* add more detailed algorithm illustration and comment, reduce test case to limit test time in 15s

* change in_dygraph_mode to in_dynamic_mode

* fix bug of sample code; reduce test case to reduce test time

* change dir to incubate

e7928a06

25 3月, 2022 1 次提交

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

01 3月, 2022 1 次提交
- S
  Optimize the CUDA kernel in DistributedFusedLamb optimizer (#39972) · d17961ed
  由 sneaxiy 提交于 3月 01, 2022
```
* vectorize lamb kernel

* remove flags, add ut

* remove useless codes

* refine code, add param order
```
  d17961ed
25 2月, 2022 1 次提交

Add MultiTensorApply to calculate L2-Norm in DistributedFusedLamb optimizer (#39900) · d32a0102

由 sneaxiy 提交于 2月 25, 2022

* add multi tensor apply l2 norm

* add multi_tensor_apply code

* make sizeof(TensorMeta) smalller

* move code to distributed_fused_lamb_op.cu

* remove useless FLAGS

d32a0102

19 2月, 2022 1 次提交

Add the DistributedFusedLamb optimizer (#39148) · 5df3cd61

由 sneaxiy 提交于 2月 19, 2022

* add DistributedFusedLamb op

* polish code

* fix compile error

* compatible with pten changement

* fix rocm compile error

* improve converage

* update upstream/develop

* fix cast_with_ptr.h

* add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1

* fix clip before allreduce

* add use_master_param_norm

* code polish

* fix bug

* fix ROCM ci

5df3cd61

15 7月, 2021 1 次提交
- W
  cache core.ops (#34058) · f05098b5
  由 wanghuancoder 提交于 7月 15, 2021
```
* cache core.ops, test=develop

* refine, test=develop
```
  f05098b5
11 6月, 2021 1 次提交
- Z
  update 2.0 public api in all left files (#33313) · 022198c5
  由 zhiboniu 提交于 6月 11, 2021
```
* update 2.0 public api in all left files

* reverse device.py all list;
fix some flake8 errors
```
  022198c5
25 1月, 2021 1 次提交
- 1
  test=develop, fix test_lookahead (#30677) · 06a3e311
  由 123malin 提交于 1月 25, 2021
```
* test=develop, fix test_lookahead
```
  06a3e311
07 1月, 2021 1 次提交
- 1
  Add Lookahead and ModelAverage Optimizer (#30004) · 198fbdfb
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add model_average and lookahead
```
  198fbdfb

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功