提交 · 77e289ae49387c5629cac7df5b6bc3d32ef10fbe · PaddlePaddle / Paddle

06 7月, 2023 1 次提交
- Z
  
  [AMP] modify default value for GradScaler (#54653) · 77e289ae
  由 Zhang Ting 提交于 7月 06, 2023
  
  77e289ae
09 6月, 2023 1 次提交
- N
  bump ruff to 0.0.272 and update config (#54449) · 8f65f72e
  由 Nyakku Shigure 提交于 6月 09, 2023
```
* bump ruff to 0.0.271 and update config

* exclude third_party

* bump ruff to 0.0.272

* refine config
```
  8f65f72e
22 5月, 2023 1 次提交
- N
  
  Fix ctest error in test_amp_api (#53885) · 56947361
  由 niuliling123 提交于 5月 22, 2023
  
  56947361
18 5月, 2023 1 次提交

[AMP]Master grad in static graph (#53362) · 972581d8

由 shaojie_wang 提交于 5月 17, 2023

* add master gradients on static graph

* add unit test for bf16 master grad static graph

* use float16 as v100 test dtype

* only skip GPU which do not support bf16

* use linear layer to test master grad

* 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states

972581d8

16 5月, 2023 1 次提交
- N
  
  [AMP] support OD level for static (#53768) · c2c3bd43
  由 niuliling123 提交于 5月 16, 2023
  
  c2c3bd43
11 5月, 2023 1 次提交
- 张
  
  昇腾和寒武纪相关代码退场 npu相关代码退场2 (#53568) · 0d45ac73
  由张春乔提交于 5月 11, 2023
  
  0d45ac73
24 4月, 2023 2 次提交

[AMP] Allow to enable multi_precision through paddle.static.amp.decorate and... · 680460fd

由 Yiqun Liu 提交于 4月 24, 2023

[AMP] Allow to enable multi_precision through paddle.static.amp.decorate and add documents for some apis. (#53012)

* Add document for some apis. test=docs_preview

* Allow to set master_weight in paddle.static.amp.decorate.

* Polish codes and add unittest.

* Refine docs.

* Remove the repetitive function.

680460fd

[AMP] support promote kernel for static graph (#52514) · 71a513c2

由 Zhang Ting 提交于 4月 24, 2023

* support promote dtype for static amp training

* unify o1 and o2

* update for unittest

* fix op_role

* add use_promote arg

* fix doc

* add promote unittest

* polish unittests

* fix controflow and test

71a513c2

18 4月, 2023 1 次提交
- Y
  [AMP] Support overload of paddle.static.amp.decorate function. (#52918) · 79a01d6c
  由 Yiqun Liu 提交于 4月 18, 2023
```
* Implement a common AmpTestBase.

* Support overload of decorate.

* Change the ignore list of flake and fix an error.
```
  79a01d6c
14 4月, 2023 1 次提交

[AMP] Unify the static amp codes of fp16 and bf16. (#52694) · dfcba7f4

由 Yiqun Liu 提交于 4月 14, 2023

* Unify the static amp codes of fp16 and bf16.

* Polish apis and add unittest.

* Add operator stats collecting tools for program.

* Add the check of number of bloat16 operators in unittest.

* Add warning for operator not supported for amp.

* Add testing of BF16 O1 and O2.

dfcba7f4

06 4月, 2023 1 次提交

rem is_compiled_with_npu (#52385) · 7976e2a3

由 Kim Yann 提交于 4月 06, 2023

* rem is_compiled_with_npu

* rem nup related code

* make lint happy

* rem test

* remove some tests

* Update grad_scaler.py

* fix an error

7976e2a3

14 2月, 2023 1 次提交
- M
  
  remove layers.tensor.argmin/argmax/assign/cast/concat/sums (#49944) · b85af464
  由 mhy-666 提交于 2月 14, 2023
  
  b85af464
17 1月, 2023 1 次提交
- Z
  
  Fix the paddle/staitc/amp/__init__.py (#49791) · fcc90531
  由 zhangkaihuo 提交于 1月 17, 2023
  
  fcc90531
12 1月, 2023 1 次提交
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
09 12月, 2022 1 次提交
- C
  
  move fluid.layers.create_global_var to static.create_global_var (#48777) · 5c64d84f
  由 cyber-pioneer 提交于 12月 09, 2022
  
  5c64d84f
02 12月, 2022 1 次提交
- H
  
  [Fluid Clean] remove paddle.fluid.layers.nn.reduce_all,reduce_any (#48269) · 0754e09d
  由 heyanru 提交于 12月 02, 2022
  
  0754e09d
08 11月, 2022 1 次提交
- N
  [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) · 888272b5
  由 Nyakku Shigure 提交于 11月 08, 2022
```
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition

* fix an increment
```
  888272b5
23 10月, 2022 1 次提交
- N
  [CodeStyle][black] use black instead of yapf (#46014) · 7097630f
  由 Nyakku Shigure 提交于 10月 23, 2022
```
* update config

* re-blacken python code

* temporarily disable date and diff_py_file

* skip a format
```
  7097630f
14 9月, 2022 1 次提交
- N
  [CodeStyle][W291] trim trailing whitespace in python file (#45937) · de8c0ba5
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* trim trailing whitespace

* fix `.cmake-format.py`

* revert npu ut changes, avoid npu ci error
```
  de8c0ba5
05 6月, 2022 1 次提交

【code format check upgrade】 step2：yapf (#42944) · a072fca8

由 Sing_chan 提交于 6月 05, 2022

* use yapf to format all python file

* yapf exclude two unittests file for they rely on writing and reading file, and format will break them

* disable diff_py_file because too many diff files cause command following failed

a072fca8

28 4月, 2022 1 次提交

Add gradient merge for DistributedFusedLamb optimizer (#40177) · 108aeb28

由 sneaxiy 提交于 4月 28, 2022

* add gradient merge for DistributedFusedLamb

* use master acc gradient

* fix CI ut

* polish

* remove math_function_impl.h change

* fix test_update_loss_scaling_op.py

* try to fix XPU/NPU CI

* add gm ut

108aeb28

19 2月, 2022 1 次提交

Add the DistributedFusedLamb optimizer (#39148) · 5df3cd61

由 sneaxiy 提交于 2月 19, 2022

* add DistributedFusedLamb op

* polish code

* fix compile error

* compatible with pten changement

* fix rocm compile error

* improve converage

* update upstream/develop

* fix cast_with_ptr.h

* add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1

* fix clip before allreduce

* add use_master_param_norm

* code polish

* fix bug

* fix ROCM ci

5df3cd61

17 12月, 2021 1 次提交

Refine some AMP operators for BERT (#37923) · d80fe268

由 sneaxiy 提交于 12月 17, 2021

* support multi precision update for LAMB

* hide some api

* fix ci uts

* fix lamb output of dygraph

* remove some changes to some PR

* try to fix Py3 CI compile error

* fix test_imperative_optimizer, add lars ut, add layer_norm ut

* fix ut, fix format

* fix ut

* fix windows ci

d80fe268

17 8月, 2021 1 次提交
- R
  
  [NPU]Adamw skip update for npu (#34897) · b4474fb4
  由 Roc 提交于 8月 17, 2021
  
  b4474fb4
22 7月, 2021 1 次提交

copy found_inf to cpu in advance to improve performance (#34274) · 781f4028

由 Leo Chen 提交于 7月 22, 2021

* copy found_inf to cpu in advance to improve performance

* add npu test

* add npu test

* refine code

* refine memcpy op

* fix adam

781f4028

19 7月, 2021 1 次提交

[amp] pass found_inf to adam to suppport skip_update (#34176) · 9bc59673

由 Leo Chen 提交于 7月 19, 2021

* pass found_inf to adam

* add unittest

* fix bug

* refine unittest

* change unit test's directory

* disable unittest on cpu

9bc59673

16 7月, 2021 1 次提交

[NPU] add clear_float_status op (#34190) · 0e4bcede

由 Leo Chen 提交于 7月 16, 2021

* add clear_float_status op

* refine infershape

* fix typo

* refine check_finite_and_scale

* refine code

0e4bcede

10 6月, 2021 1 次提交
- B
  
  dp c_allreduce_sum_fusion op (#33169) · 003b4616
  由 Baibaifan 提交于 6月 10, 2021
  
  003b4616
23 4月, 2021 1 次提交

[NPU] refactor check_finite_and_scale npu kernel (#32407) · 39a59dcf

由 Leo Chen 提交于 4月 23, 2021

* refactor_check_finite_and_scale_npu_kernel

* fix compile

* add alloc_float_status op

* add alloc_float_status op

* add FloatStatus for check_finite_and_unscale

* refine code

* remove unneccessary logic

* refine for fleet

39a59dcf

22 4月, 2021 1 次提交
- Y
  
  Add fleet get_loss_scaling doc and update alert message (#32419) · d03b0b16
  由 Yuang Liu 提交于 4月 22, 2021
  
  d03b0b16
21 4月, 2021 1 次提交
- Y
  
  add get_loss_scaling to fleet (#32401) · 37bb3342
  由 Yuang Liu 提交于 4月 21, 2021
  
  37bb3342
13 1月, 2021 1 次提交
- H
  
  add amp example document (#30314) · 342d62de
  由 huangxu96 提交于 1月 13, 2021
  
  342d62de
08 1月, 2021 1 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

05 1月, 2021 1 次提交
- W
  
  [fleet] combine amp and gradient merge, test=develop (#30086) · ab049978
  由 WangXi 提交于 1月 05, 2021
  
  ab049978
09 12月, 2020 1 次提交
- A
  
  fix amp support fleet (#29491) · 5d530c93
  由 Aurelius84 提交于 12月 09, 2020
  
  5d530c93
30 11月, 2020 1 次提交
- W
  
  optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
  由 WangXi 提交于 11月 30, 2020
  
  0c2a51d2
12 10月, 2020 1 次提交
- W
  
  fleet combine amp dgc recompute meta optimizer (#27643) · 0a1862d1
  由 WangXi 提交于 10月 12, 2020
  
  0a1862d1
14 9月, 2020 1 次提交

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

08 1月, 2020 1 次提交
- G
  
  fix init scaling value test=develop (#22145) · 5e07db15
  由 gongweibao 提交于 1月 08, 2020
  
  5e07db15
26 11月, 2019 1 次提交
- Z
  Fix some typos in AMP. (#21354) · be2e3e67
  由 Zhen Wang 提交于 11月 26, 2019
```
* fix some typos in AMP. test=develop

* delete useless codes. test=develop
```
  be2e3e67

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功