提交 · 5a2ab683e28359145a5f938fabb78b3f80c53a68 · PaddlePaddle / Paddle

01 11月, 2022 3 次提交

[CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` (#47464) · 5a2ab683

由 Nyakku Shigure 提交于 11月 01, 2022

* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True`

* revert changes in fluid

* revert unrelated file

* revert changes in norm

* revert changes in auto_parallel_amp

* fix norm and auto_parallel_amp

* revert a typo fix due to fixed at #47477

5a2ab683

[CodeStyle][py2] remove `six` package (part2) (#47334) · 3592ba8c

由 Nyakku Shigure 提交于 11月 01, 2022

* [CodeStyle][py2] remove `six` package (part2)

* six.ensure_str

* remove unused `import six`

* remove six from BUILTIN_LIKELY_MODULES

* remove six in example code

* remove some decode

* try to fix example code

* fix MockEtcdClient get/get_prefix returns data type

* fix MockEtcdClient get_prefix returns data

* fix MockEtcdClient get returns data

* remove `six` in pypi and conda requirements

* fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type

* refine MockEtcdClient

3592ba8c

S

add missing scale parameter (#47519) · ad251cb5
由 sneaxiy 提交于 11月 01, 2022

ad251cb5

31 10月, 2022 1 次提交

[Auto Parallel] Improve the c++ dist attr (#47358) · b03b4a3c

由 Yulong Ao 提交于 10月 31, 2022

* [Auto Parallel] Improve the c++ dist attr

* [Auto Parallel] Modify test_program.py

* [Auto Parallel] Add the missiong import

b03b4a3c

28 10月, 2022 3 次提交
- S
  Add fused_allreduce_gradients_with_group for PPFleetX (#47447) · c036c5c0
  由 sneaxiy 提交于 10月 28, 2022
```
* add fused_allreduce_gradients_with_group

* add scale

* fix ci
```
  c036c5c0
- Z
  [AutoParallel] fix engine _build and cost method (#47263) · 315ef265
  由 zhaoyingli 提交于 10月 28, 2022
```
* fix engine build method

* fix import

* update engine cost

* update raise error

* update cmakelist

* revert optimizer

* revert optimizer

* fix unittest

* fix unittest
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
```
  315ef265
- L
  
  remove tcp store barrier (#47184) · 0f649b32
  由 LiYuRio 提交于 10月 28, 2022
  
  0f649b32
26 10月, 2022 1 次提交
- R
  
  fix a bug that print log twice (#47336) · c0525b82
  由 Roc 提交于 10月 26, 2022
  
  c0525b82
24 10月, 2022 3 次提交
- H
  
  fix warning infos of recompute hybrid in eager mode (#47288) · 5e97651e
  由 Haohongxiang 提交于 10月 24, 2022
  
  5e97651e
- N
  [CodeStyle][black] format dy2static unittests (#47268) · 512cb296
  由 Nyakku Shigure 提交于 10月 24, 2022
```
* [CodeStyle][black] format dy2static unittests

* format some missing files

* update lineno in test_origin_info

* update lineno in test_error

* update lineno
```
  512cb296
- T
  [CodeStyle][F522] Remove unused arguments (#46743) · cc753aa4
  由 Tony Cao 提交于 10月 24, 2022
```
* Fix F522: remove unused arguments

* add redirect_stderr argument in _run_cmd
```
  cc753aa4
23 10月, 2022 1 次提交
- N
  [CodeStyle][black] use black instead of yapf (#46014) · 7097630f
  由 Nyakku Shigure 提交于 10月 23, 2022
```
* update config

* re-blacken python code

* temporarily disable date and diff_py_file

* skip a format
```
  7097630f
21 10月, 2022 3 次提交
- K
  
  fix numpy issue in codeblock examples (#47042) · a6574658
  由 Kevin吴嘉文提交于 10月 21, 2022
  
  a6574658
- Y
  
  Fix virtualpp with mp/recompute bugs (#47242) · 9be2b721
  由 Yuang Liu 提交于 10月 21, 2022
  
  9be2b721
- C
  
  fix process group init bug (#47224) · f1b8f0ef
  由 caozhou 提交于 10月 21, 2022
  
  f1b8f0ef
20 10月, 2022 4 次提交

Z
[AutoParallel] fix fp16 for subblock (#47189) · 979af475
由 zhaoyingli 提交于 10月 20, 2022
```
* [AutoParallel] fix fp16 for subblock

* fix engine

* fix comment
```
979af475

[CodeStyle][W605] Add escape symbols to some strings (#46752) · e1c0461d

由 Tony Cao 提交于 10月 20, 2022

* Fix W605 in tools folder by adding escape symbols

* Fix W605 in incubate and some other folders

* Fix W605 in /fluid/test folders

* Update tools/analysisPyXml.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* Add some changes to manual and auto escape symbols

* revert changes in transformer.py

* Fix new code with W605 error: add escape symbols

* revert changes in transformer.py

* revert changes in transformer.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

e1c0461d

H

support qat in sharding stage2 (#47169) · 0e552c08
由 Haohongxiang 提交于 10月 20, 2022

0e552c08
W
add test for stage2 + dp (#47114) · 8d2ce06e
由 wuhuachaocoding 提交于 10月 20, 2022
```
* add test for stage2 + dp

* update test for stage2 + dp.

* update.

* update.
```
8d2ce06e

19 10月, 2022 3 次提交
- N
  
  [CodeStyle][py2] remove `six` package (part 1) (#46965) · e6fb551c
  由 Nyakku Shigure 提交于 10月 19, 2022
  
  e6fb551c
- N
  
  [CodeStyle][F403] expand star import (#46946) · 499d2daf
  由 Nyakku Shigure 提交于 10月 19, 2022
  
  499d2daf
- R
  
  fix send for old dygraph mode by passing use_calc_stream to the send op (#47110) · d817d896
  由 Roc 提交于 10月 19, 2022
  
  d817d896
18 10月, 2022 6 次提交

[Auto Parallel]Add parallel tuner (#46189) · 3108ba11

由 caozhou 提交于 10月 18, 2022

* add parallel tuner

* add unittest

* fix unittest

* set timeout of unittest

* set unittest timeout

* fix auto_mode setting

* update unittest

* sync from develop and update unittest

* remove unused import

* update unittest

* update cmakelist

* add unittests

3108ba11

L

add strategy group (#47021) · 178d7e5e
由 LiYuRio 提交于 10月 18, 2022

178d7e5e

[Auto Parallel] Add cost interface (#47043) · da051350

由 caozhou 提交于 10月 18, 2022

* add cost interface

* update inferface and add unittest

* update unittest

* update inferface

da051350

N

remove __future__ import in docstring, test=document_fix (#46890) · 30dae6db
由 Nyakku Shigure 提交于 10月 18, 2022

30dae6db

[CodeStyle][py2] remove `compat` module (to_text) (#47036) · ad4c773b

由 Nyakku Shigure 提交于 10月 18, 2022

* [CodeStyle][py2] remove `compat` module (to_text)

* remove some unnecessary decode

* remove to_text definition and unittest

* Revert "remove to_text definition and unittest"

This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267.

* remove an assertion

* empty commit

ad4c773b

[AutoParallel] add callbacks (#47014) · 7c92177c

由 zhaoyingli 提交于 10月 18, 2022

* [AutoParallel] add callbacks

* fix unittest

* fix dist_context

* fix engine

* fix cmakelist

* fix unittest's returns

* fix cmakelist

7c92177c

17 10月, 2022 3 次提交

Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Support allow_partial switch, which can be configure in
pipeline_configs. If sent tensor are not the same from
different hosts, they shouldn't been sent partially and
then concated as a whole tensor.

* Change name allow_partial to enable_partial_send_recv.

* Add global variable _enable_partial_send_recv

b9a2f29c

Support BF16 training for sharding (#46846) · 0b39b244

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

0b39b244

Y
[Auto Parallel] Fix the bug of completion (#47056) · f0af2708
由 Yulong Ao 提交于 10月 17, 2022
```
* [Auto Parallel] Fix the bug for None labels

* [Auto Parallel] Fix the completion bug
```
f0af2708

14 10月, 2022 3 次提交
- W
  
  Fix collective APIs cannot be recognized when building docs (#46962) · 2010bdc3
  由 Wen Sun 提交于 10月 14, 2022
  
  2010bdc3
- Z
  [AutoParallel] adapt for gpt-gen (#46771) · 31a437b1
  由 zhaoyingli 提交于 10月 14, 2022
```
* for gpt-gen

* fix reshard

* adapt assign and shape op

* add dist_assign & unittest

* add conditional block unittest

* rename unittest
```
  31a437b1
- Y
  
  [Auto Parallel] Fix the bug for None labels (#46987) · 974e98bc
  由 Yulong Ao 提交于 10月 14, 2022
  
  974e98bc
13 10月, 2022 3 次提交

W
combine dp and stage2 hybrid parallel. (#46795) · a95b6f33
由 wuhuachaocoding 提交于 10月 13, 2022
```
* combine dp and stage2 hybrid parallel.

* update condition.
```
a95b6f33

[WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc

由 Xinger 提交于 10月 13, 2022

* add rpc module in cpp side

* add rpc module in python side

* support win32 and mac for rpc

* 代码优化

* 优化代码

* update rpc

* update rpc launch

* rpc remove rank and world_size api

* fix logger import bug

* remove support for win and mac

* remove support for xpu, npu, cinn and rocm

* remove support for xpu, npu, cinn and rocm

* fix shutdown barrier timeout bug

* update:python_rpc_handler to shared ptr

* fix master shutodwn first bug

* tests support for cpu

* update log to vlog

* update get service info api

* add single process test case

* remove process group

* remove some useless dependencies

* update rpc api comments

* update rpc comments: Example to Examples

* update rpc api comments

* update rpc api comments

* update launch api comments

* update init_rpc comments

* update rpc sync and async comments

* fix bug: init_rpc cant be called repeatly in a process

* update rpc api comment: make master endpoint unique

* update rpc api:service to worker, timeout_ms to timeout

* rename ServiceInfo to WorkerInfo

* refactor: rename server to worker, log to vlog

* add launch test

* remove unused codes

* refine

f0afcabc

N

[CodeStyle][F401] fix incremental flake8 F401 and F541 issues (#46926) · f4a5fe95
由 Nyakku Shigure 提交于 10月 13, 2022

f4a5fe95

12 10月, 2022 3 次提交

J

bugfix (#46921) · acdaa4fb
由 JZ-LIANG 提交于 10月 12, 2022

acdaa4fb

[Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a

由 Yulong Ao 提交于 10月 12, 2022

* [Auto Parallel] Suppport different dataloaders

* [Auto Parallel] Add num_shards config for dataset

* [Auto Parallel] Unify the logger and outputs of Engine API

* [Auto Parallel] Fix the bugs of to_static

* [Auto Parallel] Adjust the test_to_static.py

* [Auto Parallel] Add the prepare API and replace __call__ with run

* [Auto Parallel] Improve the private implementations of Engine

* [Auto Parallel] Set capacity of dataloader for opt tuning

* [Auto Parallel] [WIP] Change the fine-grained API

* [Auto Parallel] Improve APIs to support different user cases

* [Auto Parallel] Add removed config

* [Auto Parallel] Add imports

* [Auto Parallel] Fix bugs for to_static

* [Auto Parallel] Remove unnecessary imports

686fa07a

[Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
05c2b9ba

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功