提交 · b7a050573803c5e5ce4480d0f20bcccfa414bbfd · PaddlePaddle / Paddle

11 7月, 2023 1 次提交

support sharding parallel (#54634) · b7a05057

由 pangengzheng 提交于 7月 11, 2023

* support sharding parallel

* fix name

* fix

* update

* test amp for sharding

---------

Co-authored-by: pangengzheng <pangengzheng.baidu.com>

b7a05057

16 5月, 2023 1 次提交
- Y
  [AMP] Allow to switch whether to use promote strategy to choose kernel for O2 training. (#53742) · db407bf0
  由 Yiqun Liu 提交于 5月 16, 2023
```
* Allow to switch whether to use promote strategy to choose kernel for O2 training.

* Fix comparing error and add unittest.
```
  db407bf0
08 5月, 2023 1 次提交
- 张
  
  rm npu (#53566) · 6d396ace
  由张春乔提交于 5月 08, 2023
  
  6d396ace
27 4月, 2023 1 次提交
- Z
  [AMP] support OD level and skip dynamic loss scaling for bf16 (#53289) · 18e9dcdc
  由 Zhang Ting 提交于 4月 27, 2023
```
* support OD level and skip dynamic loss scaling for bf16
```
  18e9dcdc
24 4月, 2023 2 次提交
- 张
  
  rm mlu (#53194) · 987fb2d8
  由张春乔提交于 4月 24, 2023
  
  987fb2d8
- Z
  
  [AMP]expand blacklists for amp training (#50940) · 41e90283
  由 Zhang Ting 提交于 4月 24, 2023
  
  41e90283
18 4月, 2023 1 次提交
- Z
  
  support excluded_layers for amp.decorate (#52871) · 534efcb6
  由 Zhang Ting 提交于 4月 18, 2023
  
  534efcb6
12 4月, 2023 1 次提交
- Q
  fix dtype cast in amp for instance_norm. (#52765) · f650e901
  由 qizhaoaoe 提交于 4月 12, 2023
```
* fix dtype cast in amp.

* add test case and update docs.

* remove set_prim.
```
  f650e901
10 4月, 2023 1 次提交

[AMP] support master_grad for amp training (#52235) · 4970dd65

由 Zhang Ting 提交于 4月 10, 2023

* support set master_grad

* move register_hook to auto_cast

* update unittest

* fix fp16 test

* update for review comments

4970dd65

06 4月, 2023 1 次提交

rem is_compiled_with_npu (#52385) · 7976e2a3

由 Kim Yann 提交于 4月 06, 2023

* rem is_compiled_with_npu

* rem nup related code

* make lint happy

* rem test

* remove some tests

* Update grad_scaler.py

* fix an error

7976e2a3

03 4月, 2023 1 次提交

rem is_compiled_with_mlu (#52378) · 4b28f4ff

由 Kim Yann 提交于 4月 03, 2023

* rem is_compiled_with_mlu

* fix some mlu_place and mlu_device_coount

* make lint happy

4b28f4ff

30 3月, 2023 2 次提交
- Y
  [AMP] Add python API for collecting operator stats. (#52215) · 73544322
  由 Yiqun Liu 提交于 3月 30, 2023
```
* [AMP] Add python API for collecting operator stats.

* Fix import and polish codes.

* Add more unittest.

* Add doc for the new APIs.
```
  73544322
- Z
  
  [AMP] use promote dtype when amp_level=O2 (#51063) · 6f8ab1fa
  由 Zhang Ting 提交于 3月 30, 2023
  
  6f8ab1fa
09 3月, 2023 1 次提交

Fix hybrid parallel training strategy using bf16 (#51103) · 8db15a42

由 Ghost Screaming 提交于 3月 09, 2023

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Remove climits.

* Fix bug of hybrid parallel strategy with recompute using bf16.

* Fix bug of recompute_hybrid ctx.amp_dtype

* Fix bug of amp_dtype.

* Fix bug of auto_cast.

8db15a42

13 2月, 2023 1 次提交
- R
  Comment: float32 => float16; test=docoument_fix (#50156) · d5b55112
  由 Ryan 提交于 2月 13, 2023
```
test=docoument_fix
```
  d5b55112
19 1月, 2023 1 次提交

[KUNLUN] add op: maxpool_with_index (#49505) · f71f77e9

由 jameszhang 提交于 1月 19, 2023

* [KUNLUN] add op: maxpool_with_index

* use DeviceContext::Alloc() instead of DenseTensor::mutable_data()

* fix file format

* solve clip unittest failure

* minor fix

* Revert "solve clip unittest failure" since the issue is fixed
in #49535

This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b.

* align with xdnn on the definition of mask in max_pool_with_index

* minor

f71f77e9

12 1月, 2023 1 次提交
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
11 1月, 2023 1 次提交
- N
  
  Update the style of print for low precision op list (#49648) · 395520f1
  由 niuliling123 提交于 1月 11, 2023
  
  395520f1
06 1月, 2023 1 次提交
- N
  
  Fix inaccurate return of low precision op list (#49391) · a214e5dc
  由 niuliling123 提交于 1月 06, 2023
  
  a214e5dc
05 1月, 2023 1 次提交
- Z
  
  move fuild.dygraph.amp to paddle.amp (#49193) · da3e9d66
  由 zhangkaihuo 提交于 1月 05, 2023
  
  da3e9d66
15 12月, 2022 1 次提交

修复paddle.amp.decorate等API的文档 (#48983) · c5af51ca

由 mjxs 提交于 12月 15, 2022

* 涉及到的api有
paddle.amp.decorate
paddle.static.npu_places
paddle.signal.istft
paddle.signal.stft
paddle.linalg.eigvalsh
paddle.randint_like

* change signal.stft

* randint_like的low增加optional

* ; test=docs_preview

* 修改了注解格式; test=docs_preview

* 修改了公式格式

* 修改了decorate的models等

* test=document_fix
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

c5af51ca

29 11月, 2022 1 次提交
- N
  [CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
  由 Nyakku Shigure 提交于 11月 29, 2022
```
* isort all files

* revert conflicting files

* revert conflicting files

* revert conflicting files
```
  f85def97
23 10月, 2022 1 次提交
- N
  [CodeStyle][black] use black instead of yapf (#46014) · 7097630f
  由 Nyakku Shigure 提交于 10月 23, 2022
```
* update config

* re-blacken python code

* temporarily disable date and diff_py_file

* skip a format
```
  7097630f
14 9月, 2022 2 次提交
- N
  [CodeStyle][W291] trim trailing whitespace in python file (#45937) · de8c0ba5
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* trim trailing whitespace

* fix `.cmake-format.py`

* revert npu ut changes, avoid npu ci error
```
  de8c0ba5
- Z
  [AMP] Support AMP-O2 for bfloat16 (#45541) · e8809d99
  由 zhangbo9674 提交于 9月 14, 2022
```
* support bfloat16 for amp_decorate

* add check_finite for bf16

* fix bug

* add ut

* add ut

* refine code
```
  e8809d99
09 5月, 2022 1 次提交
- L
  fix docs of auto_cast, cuda_places, static.save (#42107) · c3b7bc61
  由 Liyulingyue 提交于 5月 09, 2022
```
* auto_cast; test=document_fix

* static.save; test=document_fix

* cuda_places; test=document_fix
```
  c3b7bc61
07 3月, 2022 1 次提交
- Z
  [AMP] refine paddle.amp.decorate code example (#40159) · da3de72d
  由 zhangbo9674 提交于 3月 07, 2022
```
* refine amp.decorate code example

* refine code
```
  da3de72d
18 2月, 2022 1 次提交

[AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848

由 zhangbo9674 提交于 2月 18, 2022

* support dtype param for auto_cast

* add amp_dtype for tracer

* add unsupported bf16 list

* support bf16 amp for O2

* refine python interface for bfloat16

* refine code

* refine code

* refine unittest

* refine code

* refine code

* add bf16 o1

* refine code by comment

* add gradient accumulator

* add recompute

7d6d3848

29 11月, 2021 1 次提交

[AMP] For `amp.decorate()` optimizers set to None is ok (#37541) · 2bb3f0b5

由 zhangbo9674 提交于 11月 29, 2021

* amp.decorate optimizers set to None is ok

* refine unittest

* add unittest and refine example code

* refine unittest

2bb3f0b5

17 9月, 2021 1 次提交

[AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d

由 zhangbo9674 提交于 9月 17, 2021

* add pure fp16 major function in auto_cast & tracer

* support master weight in dygraph for pure fp16

* check mix dtype of fp16&fp32 for check_finite_and_unscale op

* change pure fp16 funtion name

* refine some bug in auto_cast

* refine auto_cast interface logic

* add param _casted_by_pure_fp16 for class Layer

* support state_dict hook for save model by user appointed dtype in pure_fp16_decorator

* refine pure_fp16_decorator as decorator

* add unittest

* add comment

* add comment

* support recompute

* add comment for auto_cast and decorator

* support to_static_state_dict for paddle.jit.save

* unlimite models num and optimizers num

* add lookup_table in black_list

* fix momentum and layer state_dict

* fix bug in layer state_dict

* fix bug in layer state_dict_helper

* refine unittest

* refine test_momentun_op

* refine interface and some code

* refine amp_decorator interface

* refine pure fp16 interface

* refine master weight interface

adaeee4d

11 6月, 2021 1 次提交
- Z
  update 2.0 public api in all left files (#33313) · 022198c5
  由 zhiboniu 提交于 6月 11, 2021
```
* update 2.0 public api in all left files

* reverse device.py all list;
fix some flake8 errors
```
  022198c5
27 4月, 2021 1 次提交

[Docs] Modified the docs of some api for supporting list/tuple args. (#32360) · 15158927

由 xiemoyuan 提交于 4月 27, 2021

* fixed docs.

* Fixed docs. test=document_fix

code bak.

fixed docs. test=document_fix

* Revert to previous version of python/paddle/fluid/backward.py

* fixed bugs.

* test=document_fix. Fixed examples.

15158927

28 10月, 2020 1 次提交
- P
  fix AMP auto_cast and grad_scaler En doc (#28177) · 8f83d5d8
  由 pangyoki 提交于 10月 28, 2020
```
* fix AMP auto_cast and grad_scaler En doc

* fix indentation problem

* change Conv2d to Conv2D
```
  8f83d5d8
21 10月, 2020 1 次提交

2.0rc api rename (#28088) · 7c1aa0d6

由 cnn 提交于 10月 21, 2020

* rename manual_seed to seed

* rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D

* rename manual_seed --> seed

* do not rename .cc, .cu and .h file

* rename manual_seed --> seed

* rename manual_seed --> seed

* rename manual_seed --> seed

* rename manual_seed --> seed

* disable_static on doc example code

* donot change manual_seed on generator

* add enable_static on sample code

* convert python/paddle/fluid/layers/nn.py to bak

* fix typo

* fix code style

* fix seed to manual_seed when call functions of Generator()

* fix bug

7c1aa0d6

30 9月, 2020 1 次提交
- L
  Move dygraph amp api to paddle-2.0 (#27681) · 69a3339a
  由 Leo Chen 提交于 9月 30, 2020
```
* move dygraph amp api to paddle

* refine code and add unit test
```
  69a3339a

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功