提交 · acbb5dbee8ce170bcc3c12e6819206f063438af5 · BaiXuePrincess / Paddle

28 4月, 2022 1 次提交
- R
  
  [CustomDevice] add amp support (#42035) · acbb5dbe
  由 ronnywang 提交于 4月 28, 2022
  
  acbb5dbe
25 3月, 2022 2 次提交

Z

fix sync_bn error in fp16 amp-o2 (#40943) · 9ab3c76b
由 zhangbo9674 提交于 3月 25, 2022

9ab3c76b

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

16 3月, 2022 1 次提交
- Q
  
  [MLU] support amp O1 of mlu (#40461) · ad81f22c
  由 qipengh 提交于 3月 16, 2022
  
  ad81f22c
15 3月, 2022 1 次提交
- F
  [NPU] add AMP O1 support (#40362) · 69dd43d1
  由 furnace 提交于 3月 15, 2022
```
* [NPU] add AMP O1 support

* [NPU] fix NOTE and warnings
```
  69dd43d1
07 3月, 2022 1 次提交
- Z
  [AMP] refine paddle.amp.decorate code example (#40159) · da3de72d
  由 zhangbo9674 提交于 3月 07, 2022
```
* refine amp.decorate code example

* refine code
```
  da3de72d
28 2月, 2022 1 次提交
- Z
  [bf16] Refine BF16 amp-o1 logic (#39815) · 18ee051e
  由 zhangbo9674 提交于 2月 28, 2022
```
* refine bf16 amp-o1 logic

* refine amp GLOG

* refine unittest

* refine unittest
```
  18ee051e
27 2月, 2022 1 次提交
- L
  fix pylayer problem with amp (#39950) · 282e09dc
  由 Leo Chen 提交于 2月 27, 2022
```
* fix pylayer problem with amp

* add ut

* refine code
```
  282e09dc
23 2月, 2022 1 次提交
- L
  fix 'is with a literal' warning (#39798) · 22abb6b3
  由 Leo Chen 提交于 2月 23, 2022
```
* fix 'is with a literal'

* fix typo
```
  22abb6b3
22 2月, 2022 1 次提交
- L
  
  fix usage of paddle.version.cuda() (#39780) · 38f87238
  由 Leo Chen 提交于 2月 22, 2022
  
  38f87238
18 2月, 2022 1 次提交

[AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848

由 zhangbo9674 提交于 2月 18, 2022

* support dtype param for auto_cast

* add amp_dtype for tracer

* add unsupported bf16 list

* support bf16 amp for O2

* refine python interface for bfloat16

* refine code

* refine code

* refine unittest

* refine code

* refine code

* add bf16 o1

* refine code by comment

* add gradient accumulator

* add recompute

7d6d3848

11 1月, 2022 1 次提交
- Z
  [AMP] Check call order of paddle.amp.decorate and paddle.DataParallel (#38785) · fbb40281
  由 zhangbo9674 提交于 1月 11, 2022
```
* check amp.decorate and DataParallel

* refine coverage

* fix layer dtype

* refine code
```
  fbb40281
29 12月, 2021 1 次提交
- Z
  [AMP] Add BatchNorm_1D_2D_3D skip for paddle.amp.decorate (#38541) · 2ebc8f77
  由 zhangbo9674 提交于 12月 29, 2021
```
* add bn_1d_2d_3d for fp16 decorate

* add unittest
```
  2ebc8f77
28 12月, 2021 1 次提交

Fix scatter_op fp16 perf problem. (#38499) · 33ce249f

由 Li Min 提交于 12月 28, 2021

* Fix scatter_op fp16 perf problem.

* Add scatter into black list.

* Add scatter into black list for dygraph.

33ce249f

27 12月, 2021 1 次提交
- Z
  [AMP] Fix amp.decorate bug: parameters for non leaf layers cannot be decotated (#38402) · 5d902954
  由 zhangbo9674 提交于 12月 27, 2021
```
* fix bug

* refine code

* refine code

* refine code
```
  5d902954
15 12月, 2021 1 次提交
- Z
  
  add use_warning of amp (#38086) · 3a2093a5
  由 zhangbo9674 提交于 12月 15, 2021
  
  3a2093a5
02 12月, 2021 1 次提交
- Z
  
  refine found_inf of loss_scaler (#37770) · cc2b4662
  由 zhangbo9674 提交于 12月 02, 2021
  
  cc2b4662
29 11月, 2021 1 次提交

[AMP] For `amp.decorate()` optimizers set to None is ok (#37541) · 2bb3f0b5

由 zhangbo9674 提交于 11月 29, 2021

* amp.decorate optimizers set to None is ok

* refine unittest

* add unittest and refine example code

* refine unittest

2bb3f0b5

24 11月, 2021 1 次提交

[Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a

由 0x45f 提交于 11月 24, 2021

* run dy2stat pure fp16 in Linear model

* no use self._pure_fp16_inputs

* add test and fix Adam error in dy2stat pure fp16 training

* use paddle.optimizer.Adam

* run test in gpu

* change test time for CI

* enlarge atol for test_resnet_pure_fp16

* refine code and enlarge atol

* make custom_white_list and custom_black_list take effect for AMP and pure fp16

* check tracer is not None

* use default atol

* change filter_size

* change atol and add some NOTE

52edad6a

09 11月, 2021 1 次提交

Refine param conversion logic in layer.to (#36862) · 993ec76a

由 zhangbo9674 提交于 11月 09, 2021

* refine layer to

* delete comment

* refine logic

* refine code

* refine pure_fp16_init

* refine comment

993ec76a

22 10月, 2021 1 次提交

[hapi] support dygraph amp O2 (#36441) · 08248db0

由 Leo Chen 提交于 10月 22, 2021

* [hapi] support dygrapg amp O2

* fix problem of static pure fp16 in hapi

* fix bug

* fix format

* fix ut

* follow comments

* update ut

* update amp save/load

* fix ut

* refine code format

08248db0

13 10月, 2021 2 次提交
- Z
  [AMP] add attr is_distributed for layer.to (#36221) · 9a9953d9
  由 zhangbo9674 提交于 10月 13, 2021
```
* add attr is_distributed

* refine code

* refine black/white list for pure fp16
```
  9a9953d9
- L
  [Amp] refine code of amp level (#36362) · 59e425cd
  由 Leo Chen 提交于 10月 13, 2021
```
* refine amp level

* fix typo

* update tracer._amp_level
```
  59e425cd
22 9月, 2021 2 次提交
- Z
  [AMP]split minimize and add unscale_ for GradScaler (#35825) · bf6f0e54
  由 zhangbo9674 提交于 9月 22, 2021
```
* split minimize() to step() + update()

* add unscale and step for grad_scaler

* add unittest

* refine code in minimize

* delete step in loss_scaler

* fix example bug

* refine comment

* refine unittest

* add unittest
```
  bf6f0e54
- Z
  
  fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) · 12ab017e
  由 zhangbo9674 提交于 9月 22, 2021
  
  12ab017e
17 9月, 2021 1 次提交

[AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d

由 zhangbo9674 提交于 9月 17, 2021

* add pure fp16 major function in auto_cast & tracer

* support master weight in dygraph for pure fp16

* check mix dtype of fp16&fp32 for check_finite_and_unscale op

* change pure fp16 funtion name

* refine some bug in auto_cast

* refine auto_cast interface logic

* add param _casted_by_pure_fp16 for class Layer

* support state_dict hook for save model by user appointed dtype in pure_fp16_decorator

* refine pure_fp16_decorator as decorator

* add unittest

* add comment

* add comment

* support recompute

* add comment for auto_cast and decorator

* support to_static_state_dict for paddle.jit.save

* unlimite models num and optimizers num

* add lookup_table in black_list

* fix momentum and layer state_dict

* fix bug in layer state_dict

* fix bug in layer state_dict_helper

* refine unittest

* refine test_momentun_op

* refine interface and some code

* refine amp_decorator interface

* refine pure fp16 interface

* refine master weight interface

adaeee4d

10 9月, 2021 1 次提交
- S
  
  fix bug of recompute in hybridparallel (#35588) · d53e567a
  由 ShenLiang 提交于 9月 10, 2021
  
  d53e567a
16 8月, 2021 1 次提交
- L
  [amp] dygraph amp support param_group (#34899) · e29c2d12
  由 Leo Chen 提交于 8月 16, 2021
```
* dygraph amp support param_group

* remove unused code

* fix doc
```
  e29c2d12
11 8月, 2021 1 次提交

[AMP] add state_dict and load_state_dict and unittest for class GradScaler (#34300) · 99f8f5c8

由 zhangbo9674 提交于 8月 11, 2021

* add state_dict and load_state_dict and unittest for class GradScaler

* refine unittest for coverage of load_state_dict

* refine comments of code-block

* refine some comments

* refine state_dict code and unittest

* add #require gpu, xpu for GradScaler get/set example code

* add #require gpu, xpu for GradScaler get/set example code

* refine example code

* refine unittest for state_dict

* refine unittest for state_dict

* fix bug of DataLoader in TestGradScalerStateDict

* add flag FLAGS_cudnn_deterministic

99f8f5c8

05 8月, 2021 1 次提交

[Dy2Stat]Support Mixed Precision training in @to_static (#34562) · a842828a

由 Aurelius84 提交于 8月 05, 2021

* Support Mixed Precision training in @to_static

* fix block.vars logic

* fix GPU training loss diff

* remove unused code

a842828a

15 7月, 2021 1 次提交
- W
  cache core.ops (#34058) · f05098b5
  由 wanghuancoder 提交于 7月 15, 2021
```
* cache core.ops, test=develop

* refine, test=develop
```
  f05098b5
05 7月, 2021 1 次提交

add `reduce_sum` op into amp black list (#33960) · aa9fdd0d

由 jiangcheng 提交于 7月 05, 2021

* reduce sum op default fp32, add into amp black list

* reduce_sum default fp32 can avoid return inf when the sum value large than 65504

aa9fdd0d

01 7月, 2021 1 次提交

[AMP] add get() and set() for Grad_scaler (#33835) · 85687348

由 zhangbo9674 提交于 7月 01, 2021

* add get and set for Grad_scaler

* refine some API name and comments

* refine API name and comments

* refine some comments

85687348

29 6月, 2021 1 次提交
- T
  
  xpu support amp (#33809) · 4d4fb660
  由 taixiurong 提交于 6月 29, 2021
  
  4d4fb660
21 6月, 2021 1 次提交
- C
  Combine amp and qat (#33484) · f88af205
  由 cc 提交于 6月 21, 2021
```
* Combine amp and qat
* add unit test
```
  f88af205
18 11月, 2020 1 次提交
- L
  Add matmtl_v2 to amp list (#28693) · 11e32baf
  由 Leo Chen 提交于 11月 18, 2020
```
* add matmtl_v2 to amp list

* support dygraph
```
  11e32baf
14 9月, 2020 1 次提交

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

13 8月, 2020 1 次提交

Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e

由 Leo Chen 提交于 8月 13, 2020

* add auto_cast, test=develop

* add loss scaler, test=develop

* add comments, test=develop

* refine code, test=develop

* refine code, test=develop

* do not set flags automatically, test=develop

* fix custom op bug, test=develop

* add more test, test=develop

* refine enable logic, test=develop

* enable amp test with GPU, test=develop

* add unittest

* add test for found_inf

* follow comments

* follow comments

* remove global variable, use singleton

* add some notes

* update comments

* update comments

* update comments

* add use_dynamic_loss_scaling argument

* refine found_inf

* refine found_inf

2d95280e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致