提交 · 083853cd4e4a9bdad22c70fa48eb9a036d2def27 · 机器未来 / Paddle

22 9月, 2022 1 次提交
- Z
  
  [Auto Parallel] fix lazyinit (#46355) (#46382) · 083853cd
  由 zhaoyingli 提交于 9月 22, 2022
  
  083853cd
19 9月, 2022 1 次提交

[Cherry-pick][Auto Parallel] Improve the APIs (#46164) · c5cc4278

由 Yulong Ao 提交于 9月 19, 2022

* [AutoParallel] adapt gradient merge pass (#45915)

* adapt gradient merge

* fix op_role

* fix strategy

* [Auto Parallel] Gradient Fuse Allreduce (#45643)

* bugfix (#45332)

* dist embedding support lookup table v1

* add unitest

* customize wait_comm

* group gradients

* bugfix

* update program

* [Auto Parallel] Improve the APIs (#45776)

* [Auto Parallel] Use c++ dist attr in the completion process

* [Auto Parallel] Add minor changes

* [Auto Parallel] Use c++ dist attr in the completion process

* [Auto Parallel] Add minor changes

* [Auto Parallel] Add the serialization process for dist attrs

* [Auto Parallel] Remove unnecessary comments

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix the code style

* [Auto Parallel] Remove unnecessary impls

* [Auto Parallel] Fix the importing error

* [Auto Parallel] Fix the copy from bugs of op dist attr

* [Auto Parallel] Replace the use of constexpr if

* [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh

* [Auto Parallel] Change API of the completion unittest

* [Auto Parallel] Fix the bug when set_attr an int

* [Auto Parallel] Add the unittest for the serialization

* [Auto Parallel] Add some unit tests

* [Auto Paralle] Unify the strategy

* [Auto Parallel] Improve the engine api

* [Auto Parallel] Reset the changes made to the framework

* [Auto Parallel] Change the engine unittest

* [Auto Parallel] Update API of the completion and partitioner

* [Auto Parallel] Update unit tests using engine api

* update shard annotation

* [Auto Parallel] Remove the modifications of other modules

* [Auto Parallel] Add docs for APIs

* add new strategy

* [Auto Parallel] Replace the logger

* [Auto Parallel] Restore the test_program.py

* [Auto Parallel] Change the import rules

* [Auto Parallel] Add the examples for Engine

* [Auto Parallel] Do some minor changes

* [Auto Parallel] Remove yaml dependency

* [Auto Parallel] Fix the unittests

* add valid after train

* bug fix
Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>

* [Auto Parallel] Bugfix allreduce fuse for MP (#46086)

* bugfix

* bugfix

* typos fixed

* update strategy (#46138)
Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com>
Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>
Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>

c5cc4278

09 9月, 2022 1 次提交

[AutoParallel] adapt lazyinit & fix pass (#45840) · bc2265f8

由 zhaoyingli 提交于 9月 09, 2022

* adapt lazy init and fix pass

* add unittest

* update comment

* fix amp and sharding

* remove clip_by_norm

bc2265f8

31 8月, 2022 1 次提交
- Z
  [AutoParallel] add grad_clip pass (#45513) · 11e62d68
  由 zhaoyingli 提交于 8月 31, 2022
```
* add grad_clip pass

* add unittest

* add notes

* update func

* add dist_attr for new op
```
  11e62d68
23 8月, 2022 1 次提交
- Z
  [AutoParallel] Add Quant Pass (#44877) · 61bc016c
  由 zhaoyingli 提交于 8月 23, 2022
```
* add quant pass
```
  61bc016c
18 8月, 2022 1 次提交
- Z
  [AutoParallel] support ClipGradByGlobalNorm (#45205) · bb6bd223
  由 zhaoyingli 提交于 8月 18, 2022
```
* add clip_grad

* fix comments

* add unittest

* update logger
```
  bb6bd223
15 8月, 2022 1 次提交
- Z
  [AutoParallel] add collate_fn for dist_loader (#45053) · 3649099f
  由 zhaoyingli 提交于 8月 15, 2022
```
* add collate_fn

* fix number of inputs
```
  3649099f
12 8月, 2022 1 次提交
- J
  [Auto Parallel] Data Parallel Optimization Pass 1 (#44882) · 7aeec4ed
  由 JZ-LIANG 提交于 8月 12, 2022
```
* bugfix

* remove scaling

* support rescale_grad opt
```
  7aeec4ed
13 7月, 2022 1 次提交
- J
  [Auto parallel] Accelerate procedure of partitioning and generating dist graphs (#44224) · 07f33da9
  由 JZ-LIANG 提交于 7月 13, 2022
```
* avoid sync with cpp in partition op

* delay eval & predict mode

* bugfix for gradient merge pass
```
  07f33da9
11 7月, 2022 1 次提交
- Z
  [AutoParallel] add 'to_static' in engine api (#44202) · 13a250a2
  由 zhaoyingli 提交于 7月 11, 2022
```
* add 'to_static' in engine api

* fix cmakelist
```
  13a250a2
29 6月, 2022 1 次提交
- J
  [Auto parallel] Bug fixed for GPT3 benchmark (#43793) · 74c9b57b
  由 JZ-LIANG 提交于 6月 29, 2022
```
* fixed bug for pass & engine

* fixed bug for benchmark GPT-3
```
  74c9b57b
06 6月, 2022 1 次提交
- Z
  [AutoParallel] fix gradient merge optimize parse (#43169) · c22e1123
  由 zhaoyingli 提交于 6月 06, 2022
```
* fix gradient merge

* bug fix

* update annotation
```
  c22e1123
05 6月, 2022 1 次提交

【code format check upgrade】 step2：yapf (#42944) · a072fca8

由 Sing_chan 提交于 6月 05, 2022

* use yapf to format all python file

* yapf exclude two unittests file for they rely on writing and reading file, and format will break them

* disable diff_py_file because too many diff files cause command following failed

a072fca8

01 6月, 2022 1 次提交

[Auto Parallel] Add miscellaneous improvements (#43108) · 010aba33

由 Yulong Ao 提交于 6月 01, 2022

* [Auto Parallel] Add the parallel tuner

* [Auto Parallel] Improve the parallel tuner and fix some bugs

* upodate cost model

* update import Resharder by dist op

* update cost model

* fix comp cost bug

* update cost model

* [Auto Parallel] Amend the dist attr for #processses=1

* update cost model and tuner

* update cost model and tuner

* update cost model and tuner

* update cluster

* update reshard

* [Auto Parallel] Add the estimation from the cost model

* [Auto Parallel] Reimplement the backup and restore functions

* [Auto Parallel] Fix the bugs of the parallel tuner

* [Auto Parallel] Update the engine api and dist context

* [Auto Parallel] Work around the high order grad problem

* [Auto Parallel] Add some miscellaneous improvements

* [Auto Parallel] Add a unittest for DistributedContext
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>

010aba33

30 5月, 2022 1 次提交
- Z
  [AutoParallel] use original id in grad_op_id_to_op_id (#42992) · 17b8446d
  由 zhaoyingli 提交于 5月 30, 2022
```
* use original id in dist_op_context.grad_op_id_to_op_id

* del assert

* remove redundant map
```
  17b8446d
19 5月, 2022 1 次提交
- Z
  [AutoParallel] split data in dataloader (#42838) · df470954
  由 zhaoyingli 提交于 5月 19, 2022
```
* slice data in dist_loader & flag to scale grad

* bug fix

* update unittest

* enable static
```
  df470954
10 5月, 2022 1 次提交

[Auto Parallel] Refactor the engine api and parallelizer (#42576) · 83a4b26a

由 Yulong Ao 提交于 5月 10, 2022

* [Auto Parallel] Refactor the engine api and parallelizer

* [Auto Parallel] Fix the default dist op for the slice op

* [Auto Parallel] Fix the format of planer.py

* [Auto Parallel] Fix a bug

83a4b26a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致