提交 · 0cf3e8f93d84598555e7e15bdc638c98b1413d5b · BaiXuePrincess / Paddle

13 12月, 2021 1 次提交

[cherry pick] Refine param conversion logic in layer.to (#38068) · 0cf3e8f9

由 zhangbo9674 提交于 12月 13, 2021

优化layer.to实现逻辑，相关pr：
Remove additional warnning in layer.to ( #36700)
Refine param conversion logic in layer.to ( #36862)
Fix Layer.to() of device bug ( #37156)

0cf3e8f9

26 10月, 2021 1 次提交
- L
  [Amp] refine code of amp level (#36362) (#36726) · 1ee4fc32
  由 Leo Chen 提交于 10月 26, 2021
```
* refine amp level

* fix typo

* update tracer._amp_level
```
  1ee4fc32
26 9月, 2021 1 次提交

[cherry pick]split minimize and add unscale_ for GradScaler (#35927) · e262125d

由 zhangbo9674 提交于 9月 26, 2021

1、Split function GradScaler::minimize() to GradScaler::step() + GradScaler::update()
2、Add GradScaler::unscale_(optimizer)

e262125d

22 9月, 2021 1 次提交
- Z
  [cherry-pick] fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) (#35900) · c0535200
  由 zhangbo9674 提交于 9月 22, 2021
```
fix bug of module paddle has no attribute fluid for python3.6.
```
  c0535200
17 9月, 2021 1 次提交

[AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d

由 zhangbo9674 提交于 9月 17, 2021

* add pure fp16 major function in auto_cast & tracer

* support master weight in dygraph for pure fp16

* check mix dtype of fp16&fp32 for check_finite_and_unscale op

* change pure fp16 funtion name

* refine some bug in auto_cast

* refine auto_cast interface logic

* add param _casted_by_pure_fp16 for class Layer

* support state_dict hook for save model by user appointed dtype in pure_fp16_decorator

* refine pure_fp16_decorator as decorator

* add unittest

* add comment

* add comment

* support recompute

* add comment for auto_cast and decorator

* support to_static_state_dict for paddle.jit.save

* unlimite models num and optimizers num

* add lookup_table in black_list

* fix momentum and layer state_dict

* fix bug in layer state_dict

* fix bug in layer state_dict_helper

* refine unittest

* refine test_momentun_op

* refine interface and some code

* refine amp_decorator interface

* refine pure fp16 interface

* refine master weight interface

adaeee4d

10 9月, 2021 1 次提交
- S
  
  fix bug of recompute in hybridparallel (#35588) · d53e567a
  由 ShenLiang 提交于 9月 10, 2021
  
  d53e567a
16 8月, 2021 1 次提交
- L
  [amp] dygraph amp support param_group (#34899) · e29c2d12
  由 Leo Chen 提交于 8月 16, 2021
```
* dygraph amp support param_group

* remove unused code

* fix doc
```
  e29c2d12
11 8月, 2021 1 次提交

[AMP] add state_dict and load_state_dict and unittest for class GradScaler (#34300) · 99f8f5c8

由 zhangbo9674 提交于 8月 11, 2021

* add state_dict and load_state_dict and unittest for class GradScaler

* refine unittest for coverage of load_state_dict

* refine comments of code-block

* refine some comments

* refine state_dict code and unittest

* add #require gpu, xpu for GradScaler get/set example code

* add #require gpu, xpu for GradScaler get/set example code

* refine example code

* refine unittest for state_dict

* refine unittest for state_dict

* fix bug of DataLoader in TestGradScalerStateDict

* add flag FLAGS_cudnn_deterministic

99f8f5c8

05 8月, 2021 1 次提交

[Dy2Stat]Support Mixed Precision training in @to_static (#34562) · a842828a

由 Aurelius84 提交于 8月 05, 2021

* Support Mixed Precision training in @to_static

* fix block.vars logic

* fix GPU training loss diff

* remove unused code

a842828a

15 7月, 2021 1 次提交
- W
  cache core.ops (#34058) · f05098b5
  由 wanghuancoder 提交于 7月 15, 2021
```
* cache core.ops, test=develop

* refine, test=develop
```
  f05098b5
05 7月, 2021 1 次提交

add `reduce_sum` op into amp black list (#33960) · aa9fdd0d

由 jiangcheng 提交于 7月 05, 2021

* reduce sum op default fp32, add into amp black list

* reduce_sum default fp32 can avoid return inf when the sum value large than 65504

aa9fdd0d

01 7月, 2021 1 次提交

[AMP] add get() and set() for Grad_scaler (#33835) · 85687348

由 zhangbo9674 提交于 7月 01, 2021

* add get and set for Grad_scaler

* refine some API name and comments

* refine API name and comments

* refine some comments

85687348

29 6月, 2021 1 次提交
- T
  
  xpu support amp (#33809) · 4d4fb660
  由 taixiurong 提交于 6月 29, 2021
  
  4d4fb660
21 6月, 2021 1 次提交
- C
  Combine amp and qat (#33484) · f88af205
  由 cc 提交于 6月 21, 2021
```
* Combine amp and qat
* add unit test
```
  f88af205
18 11月, 2020 1 次提交
- L
  Add matmtl_v2 to amp list (#28693) · 11e32baf
  由 Leo Chen 提交于 11月 18, 2020
```
* add matmtl_v2 to amp list

* support dygraph
```
  11e32baf
14 9月, 2020 1 次提交

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

13 8月, 2020 1 次提交

Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e

由 Leo Chen 提交于 8月 13, 2020

* add auto_cast, test=develop

* add loss scaler, test=develop

* add comments, test=develop

* refine code, test=develop

* refine code, test=develop

* do not set flags automatically, test=develop

* fix custom op bug, test=develop

* add more test, test=develop

* refine enable logic, test=develop

* enable amp test with GPU, test=develop

* add unittest

* add test for found_inf

* follow comments

* follow comments

* remove global variable, use singleton

* add some notes

* update comments

* update comments

* update comments

* add use_dynamic_loss_scaling argument

* refine found_inf

* refine found_inf

2d95280e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致