提交 · 3bf8a34c69a5e8a859d75699462e6e66ef27ddce · BaiXuePrincess / Paddle

12 3月, 2021 1 次提交

[NPU] Support npu kernel for amp_check_finite_and_unscale_npu op (#31457) · 3bf8a34c

由 xiayanming 提交于 3月 12, 2021

* Support npu kernel for amp_check_finite_and_unscale_npu op

* support EnforceNotMet exception

* fix exception bug

* modify python unittest

* precommit

* update c++ unittest

* fix review

* fix review

3bf8a34c

08 1月, 2021 1 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

10 12月, 2020 1 次提交
- Z
  Remove tensor copy in the update_loss_scaling op. (#29426) · 5ac71b36
  由 Zhen Wang 提交于 12月 10, 2020
```
* remove tensor copy in the update_loss_scaling op

* not use thrust.

* fix some cuda memory access error.
```
  5ac71b36
07 12月, 2020 1 次提交

Compiling operator libraries with Unity build (#29130) · 671555ed

由 LoveAn 提交于 12月 07, 2020

* Compiling operator libraries with Unity Build on Windows CPU.

* Compiling operator libraries with Unity Build on Windows GPU, no_test, test=windows_ci

* Add option in windows ci script, no_test, test=windows_ci

* Optimize parallel compiling, test=develop

* remove limit of parallel compile and skip some ops in UB, test=develop

* remove changes of header file, test=develop

* remove changes of header file, test=develop

* fix test_eye_op unittest failed, test=develop

* Compiling operator libraries with Unity Build on Linux, test=develop

* set default WITH_UNITY_BUILD=OFF, test=develop

* Move unity build rules into a single file and add comment, test=develop

* optimize parallel compilation, test=develop

* fix undefined reference error on coverage ci, test=develop

671555ed

30 11月, 2020 1 次提交
- W
  
  optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
  由 WangXi 提交于 11月 30, 2020
  
  0c2a51d2
09 10月, 2020 1 次提交
- Z
  
  fix error message showing in UpdateLossScalingOp (#27596) · 365c2c9c
  由 Zhen Wang 提交于 10月 09, 2020
  
  365c2c9c
14 9月, 2020 1 次提交

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

13 8月, 2020 1 次提交

Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e

由 Leo Chen 提交于 8月 13, 2020

* add auto_cast, test=develop

* add loss scaler, test=develop

* add comments, test=develop

* refine code, test=develop

* refine code, test=develop

* do not set flags automatically, test=develop

* fix custom op bug, test=develop

* add more test, test=develop

* refine enable logic, test=develop

* enable amp test with GPU, test=develop

* add unittest

* add test for found_inf

* follow comments

* follow comments

* remove global variable, use singleton

* add some notes

* update comments

* update comments

* update comments

* add use_dynamic_loss_scaling argument

* refine found_inf

* refine found_inf

2d95280e

05 6月, 2020 1 次提交

Fix/isfinite on windows (#24927) · a7cb97a1

由 Leo Chen 提交于 6月 05, 2020

* refine isfinite, test=develop

* use namespace std of isfinite, test=develop, test=win_gpu

a7cb97a1

04 6月, 2020 1 次提交

Feature/add amp_checkout_finite_and_scale op (#24875) · 1e818158

由 Leo Chen 提交于 6月 04, 2020

* add amp_check_finite_and_scale op, test=develop

* add cpu kernel, test=develop

* use bool, test=develop

* follow comments, test=develop

1e818158

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致