提交 · 0b4a7f1aa3ee54bc32c4bd82f4640f09dfa36b70 · PaddlePaddle / Paddle

10 6月, 2021 1 次提交
- B
  
  dp c_allreduce_sum_fusion op (#33169) · 003b4616
  由 Baibaifan 提交于 6月 10, 2021
  
  003b4616
26 5月, 2021 1 次提交
- J
  
  [Tensor Parallelism] split fix bug (#33015) · 20b9be65
  由 JZ-LIANG 提交于 5月 26, 2021
  
  20b9be65
07 5月, 2021 1 次提交
- J
  Mechanism that converts startup_program initializers to BF16 (#32720) · ce2bdb0a
  由 joanna.wozna.intel 提交于 5月 07, 2021
```
* Add casting initializers for bf16 training

* Changes after review

* Correct test and add comment
```
  ce2bdb0a
28 4月, 2021 1 次提交
- A
  
  Added pure_bf16 mode (#32281) · bc379ca3
  由 arlesniak 提交于 4月 28, 2021
  
  bc379ca3
23 4月, 2021 1 次提交

[NPU] refactor check_finite_and_scale npu kernel (#32407) · 39a59dcf

由 Leo Chen 提交于 4月 23, 2021

* refactor_check_finite_and_scale_npu_kernel

* fix compile

* add alloc_float_status op

* add alloc_float_status op

* add FloatStatus for check_finite_and_unscale

* refine code

* remove unneccessary logic

* refine for fleet

39a59dcf

22 4月, 2021 1 次提交
- Y
  
  Add fleet get_loss_scaling doc and update alert message (#32419) · d03b0b16
  由 Yuang Liu 提交于 4月 22, 2021
  
  d03b0b16
21 4月, 2021 2 次提交
- H
  
  fix bug in amp O2 (#32343) · 4be3b057
  由 huangxu96 提交于 4月 21, 2021
  
  4be3b057
- Y
  
  add get_loss_scaling to fleet (#32401) · 37bb3342
  由 Yuang Liu 提交于 4月 21, 2021
  
  37bb3342
15 4月, 2021 1 次提交
- F
  fix test sync_with_cpp (#32212) · 0c037d2d
  由 fangshuixun007 提交于 4月 15, 2021
```
fix test sync_with_cpp (#32212)
```
  0c037d2d
08 4月, 2021 1 次提交

The unsupported_fp16_list using in AMP will be created automatically during the runtime. (#32102) · 6e65fe02

由 Zhen Wang 提交于 4月 08, 2021

* Use the runtime to create the unsupported_fp16_list using in AMP.

* Add more infos about supported ops.

* Add some comments for the function of OpSupportedInfos.

* Fix the unit test of test_multi_precision_fp16_train.

6e65fe02

26 3月, 2021 1 次提交
- L
  [3D-parallel] Reformat pipeline parallel (#31786) · c3974d0e
  由 lilong12 提交于 3月 26, 2021
```
* update, test=develop
```
  c3974d0e
22 3月, 2021 1 次提交
- A
  
  [oneDNN] Initial bf16 amp integration (#31093) · 7ccf6b60
  由 arlesniak 提交于 3月 22, 2021
  
  7ccf6b60
20 1月, 2021 1 次提交
- H
  Add fleet amp_init() (#30572) · 13862008
  由 huangxu96 提交于 1月 20, 2021
```
* add fleet amp.init()

* add unittest for fleet_amp_init
```
  13862008
13 1月, 2021 1 次提交
- H
  
  add amp example document (#30314) · 342d62de
  由 huangxu96 提交于 1月 13, 2021
  
  342d62de
08 1月, 2021 1 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

05 1月, 2021 1 次提交
- W
  
  [fleet] combine amp and gradient merge, test=develop (#30086) · ab049978
  由 WangXi 提交于 1月 05, 2021
  
  ab049978
15 12月, 2020 1 次提交
- H
  add alias for fluid.contrib.mixed_precision (#29562) · c05170d3
  由 huangxu96 提交于 12月 15, 2020
```
* add alias for fluid.contrib.mixed_precision
```
  c05170d3
09 12月, 2020 1 次提交
- A
  
  fix amp support fleet (#29491) · 5d530c93
  由 Aurelius84 提交于 12月 09, 2020
  
  5d530c93
02 12月, 2020 2 次提交

Add pure fp16 training with master weights. (#27712) · be3777a5

由 Zhen Wang 提交于 12月 02, 2020

* add the weight decay func for the momentum op

* Add the multi_precision function in Momentum Optimizer.

* Make sure that the initial value of master weights are same with the fp16 weights.

* add static loss scaling.

* add the rescale_grad function in the pure fp16 training.

* use the original momentum updating method.

* Polish some codes, such as variable names.

* add docstring for apis.

* update the var creation details of _create_master_weight.

* not modify codes about imperative momentum updating.

* Fix the error of test_dist_sparse_tensor_load_momentum UT.

* add unit test for multi precision fp16 training.

* add more unit tests for CI.

* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.

* For CI Coverage Checking.

be3777a5

Layer norm fp16 (#29169) · 7584bb50

由 furnace 提交于 12月 02, 2020

* add fp16 for layer_norm op

* revert layernorm api

* fix forward

* fix forward

* fix backward for layernorm with fp16

* fix unit test for layernorm with fp16

* fix with_mkldnn compile error for layernorm with fp16

* 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>

* fix with_mkldnn compile error for layernorm with fp16

* fix with_mkldnn compile error for layernorm with fp16
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

7584bb50

30 11月, 2020 1 次提交
- W
  
  optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
  由 WangXi 提交于 11月 30, 2020
  
  0c2a51d2
18 11月, 2020 1 次提交
- L
  Add matmtl_v2 to amp list (#28693) · 11e32baf
  由 Leo Chen 提交于 11月 18, 2020
```
* add matmtl_v2 to amp list

* support dygraph
```
  11e32baf
04 11月, 2020 1 次提交
- L
  Skip reader op in mixed_precision decorator (#28353) · 71d62207
  由 Leo Chen 提交于 11月 04, 2020
```
* skip reader op in mixed_precision decorator

* add ut
```
  71d62207
12 10月, 2020 1 次提交
- W
  
  fleet combine amp dgc recompute meta optimizer (#27643) · 0a1862d1
  由 WangXi 提交于 10月 12, 2020
  
  0a1862d1
23 9月, 2020 1 次提交
- Z
  add fuse_bn_act op (#27230) · 906e7f92
  由 Zhang Ting 提交于 9月 23, 2020
```
* add fused_bn_add_relu op
```
  906e7f92
14 9月, 2020 1 次提交

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

03 9月, 2020 1 次提交
- Z
  
  fix some cast error. (#26884) · bcdbac17
  由 Zhen Wang 提交于 9月 03, 2020
  
  bcdbac17
15 4月, 2020 1 次提交
- M
  fix AMP and recompute (#23551) · f0e743f1
  由 mapingshuo 提交于 4月 15, 2020
```
* allow amp and recompute working together
```
  f0e743f1
08 1月, 2020 1 次提交
- G
  
  fix init scaling value test=develop (#22145) · 5e07db15
  由 gongweibao 提交于 1月 08, 2020
  
  5e07db15
26 11月, 2019 1 次提交
- Z
  Fix some typos in AMP. (#21354) · be2e3e67
  由 Zhen Wang 提交于 11月 26, 2019
```
* fix some typos in AMP. test=develop

* delete useless codes. test=develop
```
  be2e3e67
30 10月, 2019 1 次提交

Add custom black variable name set in amp interface. (#20875) · 3255fe69

由 gongweibao 提交于 10月 30, 2019

* add custom black varname test=develop

* fix dtype test=develop

* fix num test=develop

* fix ut test=develop

* fix coverage test=develop

* fix blackvar names test=develop

3255fe69

15 10月, 2019 1 次提交
- G
  
  Add interface so user can get scaled loss when they use customized loss. (#20571) · 1d82025e
  由 gongweibao 提交于 10月 15, 2019
  
  1d82025e
10 10月, 2019 1 次提交
- G
  
  delete backward return list test=develop (#20294) · 7b9e3397
  由 gongweibao 提交于 10月 10, 2019
  
  7b9e3397
19 9月, 2019 1 次提交
- J
  Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714) · d9db94d7
  由 Jie Fang 提交于 9月 19, 2019
```
Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
```
  d9db94d7
10 9月, 2019 1 次提交
- G
  Fix float16 optimizer. (#19682) · 6c2bc29c
  由 gongweibao 提交于 9月 10, 2019
```
Fix float16 optimizer
```
  6c2bc29c
06 9月, 2019 1 次提交
- J
  init new amp, optimize inserting cast op for batchnorm (#18596) · c6a598a2
  由 Jie Fang 提交于 9月 06, 2019
```
init new amp, optimize inserting cast op for batchnorm
```
  c6a598a2
03 9月, 2019 1 次提交
- G
  Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506) · abaf87be
  由 gongweibao 提交于 9月 03, 2019
```
Change backward_guard to optimize_guard to maximize the allreduce overlap
```
  abaf87be
31 8月, 2019 1 次提交
- Z
  
  remove reset recordio usage (#19519) · 5dce1da6
  由 Zeng Jinle 提交于 8月 31, 2019
  
  5dce1da6
28 6月, 2019 1 次提交
- J
  init custom black white list (#18377) · 2b4ef509
  由 Jie Fang 提交于 6月 28, 2019
```
test=develop
```
  2b4ef509
25 6月, 2019 1 次提交
- J
  init black/white lists (#17847) · 172c2fac
  由 Jie Fang 提交于 6月 25, 2019
```
test=develop
```
  172c2fac

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功