提交 · 016766cc89fabc10181453ce70b701dd8ed019f6 · PaddlePaddle / Paddle

21 10月, 2022 3 次提交
- K
  
  fix numpy issue in codeblock examples (#47042) · a6574658
  由 Kevin吴嘉文提交于 10月 21, 2022
  
  a6574658
- Y
  
  Fix virtualpp with mp/recompute bugs (#47242) · 9be2b721
  由 Yuang Liu 提交于 10月 21, 2022
  
  9be2b721
- C
  
  fix process group init bug (#47224) · f1b8f0ef
  由 caozhou 提交于 10月 21, 2022
  
  f1b8f0ef
20 10月, 2022 4 次提交

Z
[AutoParallel] fix fp16 for subblock (#47189) · 979af475
由 zhaoyingli 提交于 10月 20, 2022
```
* [AutoParallel] fix fp16 for subblock

* fix engine

* fix comment
```
979af475

[CodeStyle][W605] Add escape symbols to some strings (#46752) · e1c0461d

由 Tony Cao 提交于 10月 20, 2022

* Fix W605 in tools folder by adding escape symbols

* Fix W605 in incubate and some other folders

* Fix W605 in /fluid/test folders

* Update tools/analysisPyXml.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* Add some changes to manual and auto escape symbols

* revert changes in transformer.py

* Fix new code with W605 error: add escape symbols

* revert changes in transformer.py

* revert changes in transformer.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

e1c0461d

H

support qat in sharding stage2 (#47169) · 0e552c08
由 Haohongxiang 提交于 10月 20, 2022

0e552c08
W
add test for stage2 + dp (#47114) · 8d2ce06e
由 wuhuachaocoding 提交于 10月 20, 2022
```
* add test for stage2 + dp

* update test for stage2 + dp.

* update.

* update.
```
8d2ce06e

19 10月, 2022 3 次提交
- N
  
  [CodeStyle][py2] remove `six` package (part 1) (#46965) · e6fb551c
  由 Nyakku Shigure 提交于 10月 19, 2022
  
  e6fb551c
- N
  
  [CodeStyle][F403] expand star import (#46946) · 499d2daf
  由 Nyakku Shigure 提交于 10月 19, 2022
  
  499d2daf
- R
  
  fix send for old dygraph mode by passing use_calc_stream to the send op (#47110) · d817d896
  由 Roc 提交于 10月 19, 2022
  
  d817d896
18 10月, 2022 6 次提交

[Auto Parallel]Add parallel tuner (#46189) · 3108ba11

由 caozhou 提交于 10月 18, 2022

* add parallel tuner

* add unittest

* fix unittest

* set timeout of unittest

* set unittest timeout

* fix auto_mode setting

* update unittest

* sync from develop and update unittest

* remove unused import

* update unittest

* update cmakelist

* add unittests

3108ba11

L

add strategy group (#47021) · 178d7e5e
由 LiYuRio 提交于 10月 18, 2022

178d7e5e

[Auto Parallel] Add cost interface (#47043) · da051350

由 caozhou 提交于 10月 18, 2022

* add cost interface

* update inferface and add unittest

* update unittest

* update inferface

da051350

N

remove __future__ import in docstring, test=document_fix (#46890) · 30dae6db
由 Nyakku Shigure 提交于 10月 18, 2022

30dae6db

[CodeStyle][py2] remove `compat` module (to_text) (#47036) · ad4c773b

由 Nyakku Shigure 提交于 10月 18, 2022

* [CodeStyle][py2] remove `compat` module (to_text)

* remove some unnecessary decode

* remove to_text definition and unittest

* Revert "remove to_text definition and unittest"

This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267.

* remove an assertion

* empty commit

ad4c773b

[AutoParallel] add callbacks (#47014) · 7c92177c

由 zhaoyingli 提交于 10月 18, 2022

* [AutoParallel] add callbacks

* fix unittest

* fix dist_context

* fix engine

* fix cmakelist

* fix unittest's returns

* fix cmakelist

7c92177c

17 10月, 2022 3 次提交

Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Support allow_partial switch, which can be configure in
pipeline_configs. If sent tensor are not the same from
different hosts, they shouldn't been sent partially and
then concated as a whole tensor.

* Change name allow_partial to enable_partial_send_recv.

* Add global variable _enable_partial_send_recv

b9a2f29c

Support BF16 training for sharding (#46846) · 0b39b244

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

0b39b244

Y
[Auto Parallel] Fix the bug of completion (#47056) · f0af2708
由 Yulong Ao 提交于 10月 17, 2022
```
* [Auto Parallel] Fix the bug for None labels

* [Auto Parallel] Fix the completion bug
```
f0af2708

14 10月, 2022 3 次提交
- W
  
  Fix collective APIs cannot be recognized when building docs (#46962) · 2010bdc3
  由 Wen Sun 提交于 10月 14, 2022
  
  2010bdc3
- Z
  [AutoParallel] adapt for gpt-gen (#46771) · 31a437b1
  由 zhaoyingli 提交于 10月 14, 2022
```
* for gpt-gen

* fix reshard

* adapt assign and shape op

* add dist_assign & unittest

* add conditional block unittest

* rename unittest
```
  31a437b1
- Y
  
  [Auto Parallel] Fix the bug for None labels (#46987) · 974e98bc
  由 Yulong Ao 提交于 10月 14, 2022
  
  974e98bc
13 10月, 2022 3 次提交

W
combine dp and stage2 hybrid parallel. (#46795) · a95b6f33
由 wuhuachaocoding 提交于 10月 13, 2022
```
* combine dp and stage2 hybrid parallel.

* update condition.
```
a95b6f33

[WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc

由 Xinger 提交于 10月 13, 2022

* add rpc module in cpp side

* add rpc module in python side

* support win32 and mac for rpc

* 代码优化

* 优化代码

* update rpc

* update rpc launch

* rpc remove rank and world_size api

* fix logger import bug

* remove support for win and mac

* remove support for xpu, npu, cinn and rocm

* remove support for xpu, npu, cinn and rocm

* fix shutdown barrier timeout bug

* update:python_rpc_handler to shared ptr

* fix master shutodwn first bug

* tests support for cpu

* update log to vlog

* update get service info api

* add single process test case

* remove process group

* remove some useless dependencies

* update rpc api comments

* update rpc comments: Example to Examples

* update rpc api comments

* update rpc api comments

* update launch api comments

* update init_rpc comments

* update rpc sync and async comments

* fix bug: init_rpc cant be called repeatly in a process

* update rpc api comment: make master endpoint unique

* update rpc api:service to worker, timeout_ms to timeout

* rename ServiceInfo to WorkerInfo

* refactor: rename server to worker, log to vlog

* add launch test

* remove unused codes

* refine

f0afcabc

N

[CodeStyle][F401] fix incremental flake8 F401 and F541 issues (#46926) · f4a5fe95
由 Nyakku Shigure 提交于 10月 13, 2022

f4a5fe95

12 10月, 2022 5 次提交

J

bugfix (#46921) · acdaa4fb
由 JZ-LIANG 提交于 10月 12, 2022

acdaa4fb

[Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a

由 Yulong Ao 提交于 10月 12, 2022

* [Auto Parallel] Suppport different dataloaders

* [Auto Parallel] Add num_shards config for dataset

* [Auto Parallel] Unify the logger and outputs of Engine API

* [Auto Parallel] Fix the bugs of to_static

* [Auto Parallel] Adjust the test_to_static.py

* [Auto Parallel] Add the prepare API and replace __call__ with run

* [Auto Parallel] Improve the private implementations of Engine

* [Auto Parallel] Set capacity of dataloader for opt tuning

* [Auto Parallel] [WIP] Change the fine-grained API

* [Auto Parallel] Improve APIs to support different user cases

* [Auto Parallel] Add removed config

* [Auto Parallel] Add imports

* [Auto Parallel] Fix bugs for to_static

* [Auto Parallel] Remove unnecessary imports

686fa07a

[Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
05c2b9ba
Y

Multi groups for broadcast of sharding stage 2 (#46894) · 95768115
由 Yuang Liu 提交于 10月 12, 2022

95768115

[CodeStyle][F401] remove unused imports in python/paddle/distributed (#46758) · fe716a0b

由 Nyakku Shigure 提交于 10月 12, 2022

* [CodeStyle][F401] remove unused import in python/paddle/distributed

* remove pass

* empty commit

* Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.

Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.

* Fix split import.

Fix split import.

* add noqa after meta_optimizers in factory

* restort collective ops

* expand `import *`

* add noqa after required imports

* try to fix APIs without core.ops

* Revert "try to fix APIs without core.ops"

This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb.

* fix an increment

* empty commit

* add noqa after required imports

* expand `import *`, fix ci error
Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>

fe716a0b

11 10月, 2022 5 次提交
- W
  
  Support both use_calc_stream and sync_op in collective communication API (#46761) · f94edc3b
  由 Wen Sun 提交于 10月 11, 2022
  
  f94edc3b
- W
  
  Completes bfloat16 dtype for collective api in eager mode (#45844) · e4eb8d36
  由 Wen Sun 提交于 10月 11, 2022
  
  e4eb8d36
- C
  
  update instantiate for auto parallel (#46883) · 3b5064d6
  由 caozhou 提交于 10月 11, 2022
  
  3b5064d6
- T
  
  Fix F524: add escape symbol to curly braces (#46745) · 2b5d325f
  由 Tony Cao 提交于 10月 11, 2022
  
  2b5d325f
- T
  [CodeStyle][E713] Convert 'not ... in ' into 'not in' (#46734) · 7ad6d9ea
  由 Tony Cao 提交于 10月 11, 2022
```
* Update README.md

* Update README.md

* Fix E713: convert 'not ... in' to 'not in'
```
  7ad6d9ea
10 10月, 2022 5 次提交
- T
  [CodeStyle][F632] Replace 'is' and 'is not' with == and != respectively (#46708) · 5194f565
  由 Tony Cao 提交于 10月 10, 2022
```
* Update README.md

* Update README.md

* Fix F632: replace 'is', 'is not' with ==, != respectively
```
  5194f565
- T
  [CodeStyle][F541] Convert f-strings without curly braces to normal strings (#46700) · c64e1dcf
  由 Tony Cao 提交于 10月 10, 2022
```
* Update README.md

* Update README.md

* Fix F541 by converting f-string to normal strings
```
  c64e1dcf
- Y
  [Auto Parallel] Fix bugs caused by the inconsistent outputs of Engine API (#46633) · 0ce5554c
  由 Yulong Ao 提交于 10月 10, 2022
```
* [Auto Parallel] Unify the logger and outputs of Engine API

* [Auto Parallel] Fix the bugs of to_static

* [Auto Parallel] Adjust the test_to_static.py
```
  0ce5554c
- W
  
  fix the combination bug of sharding stage1 + dp (#46631) · 6e4cba14
  由 wuhuachaocoding 提交于 10月 10, 2022
  
  6e4cba14
- L
  
  Move group and all reduce from collective to communication (#45848) · a0dffd39
  由 LiYuRio 提交于 10月 10, 2022
  
  a0dffd39

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功