提交 · 7097630f3d137c0088c0b60c4b96dd4ecac06409 · PaddlePaddle / Paddle

23 10月, 2022 1 次提交
- N
  [CodeStyle][black] use black instead of yapf (#46014) · 7097630f
  由 Nyakku Shigure 提交于 10月 23, 2022
```
* update config

* re-blacken python code

* temporarily disable date and diff_py_file

* skip a format
```
  7097630f
21 10月, 2022 1 次提交
- Y
  
  Fix virtualpp with mp/recompute bugs (#47242) · 9be2b721
  由 Yuang Liu 提交于 10月 21, 2022
  
  9be2b721
20 10月, 2022 3 次提交

[CodeStyle][W605] Add escape symbols to some strings (#46752) · e1c0461d

由 Tony Cao 提交于 10月 20, 2022

* Fix W605 in tools folder by adding escape symbols

* Fix W605 in incubate and some other folders

* Fix W605 in /fluid/test folders

* Update tools/analysisPyXml.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* Add some changes to manual and auto escape symbols

* revert changes in transformer.py

* Fix new code with W605 error: add escape symbols

* revert changes in transformer.py

* revert changes in transformer.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

e1c0461d

H

support qat in sharding stage2 (#47169) · 0e552c08
由 Haohongxiang 提交于 10月 20, 2022

0e552c08
W
add test for stage2 + dp (#47114) · 8d2ce06e
由 wuhuachaocoding 提交于 10月 20, 2022
```
* add test for stage2 + dp

* update test for stage2 + dp.

* update.

* update.
```
8d2ce06e

19 10月, 2022 1 次提交
- R
  
  fix send for old dygraph mode by passing use_calc_stream to the send op (#47110) · d817d896
  由 Roc 提交于 10月 19, 2022
  
  d817d896
17 10月, 2022 2 次提交

Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Support allow_partial switch, which can be configure in
pipeline_configs. If sent tensor are not the same from
different hosts, they shouldn't been sent partially and
then concated as a whole tensor.

* Change name allow_partial to enable_partial_send_recv.

* Add global variable _enable_partial_send_recv

b9a2f29c

Support BF16 training for sharding (#46846) · 0b39b244

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

0b39b244

13 10月, 2022 1 次提交
- W
  combine dp and stage2 hybrid parallel. (#46795) · a95b6f33
  由 wuhuachaocoding 提交于 10月 13, 2022
```
* combine dp and stage2 hybrid parallel.

* update condition.
```
  a95b6f33
12 10月, 2022 2 次提交

Y

Multi groups for broadcast of sharding stage 2 (#46894) · 95768115
由 Yuang Liu 提交于 10月 12, 2022

95768115

[CodeStyle][F401] remove unused imports in python/paddle/distributed (#46758) · fe716a0b

由 Nyakku Shigure 提交于 10月 12, 2022

* [CodeStyle][F401] remove unused import in python/paddle/distributed

* remove pass

* empty commit

* Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.

Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.

* Fix split import.

Fix split import.

* add noqa after meta_optimizers in factory

* restort collective ops

* expand `import *`

* add noqa after required imports

* try to fix APIs without core.ops

* Revert "try to fix APIs without core.ops"

This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb.

* fix an increment

* empty commit

* add noqa after required imports

* expand `import *`, fix ci error
Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>

fe716a0b

09 10月, 2022 1 次提交
- Y
  
  [dygraph sharding stage 2] sharding broadcast overlap (#46656) · d8b4ca92
  由 Yuang Liu 提交于 10月 09, 2022
  
  d8b4ca92
08 10月, 2022 1 次提交
- H
  
  [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116) · 8c0529fd
  由 Haohongxiang 提交于 10月 08, 2022
  
  8c0529fd
28 9月, 2022 2 次提交
- Y
  
  [dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495) · 9c01eaed
  由 Yuang Liu 提交于 9月 28, 2022
  
  9c01eaed
- Y
  
  [dygraph pp] all sync for allgather partial (#46483) · 3cbf0e93
  由 Yuang Liu 提交于 9月 28, 2022
  
  3cbf0e93
22 9月, 2022 1 次提交
- Y
  
  [interleave pp] sync recv for 1f1b (#46399) · f7784700
  由 Yuang Liu 提交于 9月 22, 2022
  
  f7784700
21 9月, 2022 1 次提交
- L
  
  change use_calc_stream to sync_op (#46182) · 2aacc034
  由 LiYuRio 提交于 9月 21, 2022
  
  2aacc034
20 9月, 2022 2 次提交
- R
  logger manager (#45909) · 264ad205
  由 Roc 提交于 9月 20, 2022
```
uniform logger manager in FleetAPI.
hidde API under distributed/utils which users don't need.
```
  264ad205
- Y
  
  dont wait for send op under dygraph pp (#46209) · 8ff7df8f
  由 Yuang Liu 提交于 9月 20, 2022
  
  8ff7df8f
19 9月, 2022 1 次提交
- W
  
  Recompute unify incubate (#46073) · 491e4df3
  由 wuhuachaocoding 提交于 9月 19, 2022
  
  491e4df3
16 9月, 2022 1 次提交

refactor mp. (#45803) · fa97e5ba

由 wuhuachaocoding 提交于 9月 16, 2022

* refactor mp.

* update setup.py.

* update mp_layers.py for compatibility.

* add documents for mp_layers.py

* update init.py

* update collective.py.

* update.

* update mp_ops.py

* update.

* update code style.

* update code style.

fa97e5ba

14 9月, 2022 1 次提交
- N
  [CodeStyle][W291] trim trailing whitespace in python file (#45937) · de8c0ba5
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* trim trailing whitespace

* fix `.cmake-format.py`

* revert npu ut changes, avoid npu ci error
```
  de8c0ba5
09 9月, 2022 2 次提交
- Y
  
  bug fix for virtual pipeline parallel (#45922) · b51c3ff8
  由 Yuang Liu 提交于 9月 09, 2022
  
  b51c3ff8
- Y
  
  fix dygraph pp + mp nan after async send/recv (#45869) · 5d7e1c91
  由 Yuang Liu 提交于 9月 09, 2022
  
  5d7e1c91
07 9月, 2022 1 次提交
- Y
  
  [dygraph hybrid pp for interleave] Save/Load for interleaved pipeline. (#45797) · a9cc0274
  由 Yuang Liu 提交于 9月 07, 2022
  
  a9cc0274
06 9月, 2022 1 次提交
- Y
  
  [dygraph hybrid pp for interleave] The interleave scheduler for pipeline parallel (#45497) · 72b5b5bf
  由 Yuang Liu 提交于 9月 06, 2022
  
  72b5b5bf
02 9月, 2022 1 次提交
- W
  
  update some input for pp and moe about recompute. (#45628) · 4c780311
  由 wuhuachaocoding 提交于 9月 02, 2022
  
  4c780311
26 8月, 2022 3 次提交
- Y
  
  [dygraph hybrid pp for interleave] Virtual pipeline layer forward function (#45444) · 81eaa97d
  由 Yuang Liu 提交于 8月 26, 2022
  
  81eaa97d
- W
  
  [Eager] delete final state pre-name (#45306) · 126940b3
  由 wanghuancoder 提交于 8月 26, 2022
  
  126940b3
- Y
  
  [dygraph hybrid pp for interleave] Virtual pp stage layer split (#45402) · 04c15e79
  由 Yuang Liu 提交于 8月 26, 2022
  
  04c15e79
16 8月, 2022 1 次提交
- H
  [Fleet] Reconstruct of Fleet API in Dygraph Mode (#44922) · c17e6af8
  由 Haohongxiang 提交于 8月 16, 2022
```
* reconstruct_of_fleet_api

* update
```
  c17e6af8
12 8月, 2022 1 次提交
- H
  
  change default log level (#45093) · 34234282
  由 hong 提交于 8月 12, 2022
  
  34234282
10 8月, 2022 1 次提交
- A
  [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute (#44737) · 81d6fa6c
  由 Aurelius84 提交于 8月 10, 2022
```
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute

* add unittest for inference predictor
```
  81d6fa6c
09 8月, 2022 1 次提交
- Y
  
  [model parallel] enable mp to use fused linear (#44968) · e84250e8
  由 Yuang Liu 提交于 8月 09, 2022
  
  e84250e8
22 7月, 2022 1 次提交
- H
  
  support send_partial, recv_partial and allgather_partial in ProcessGroupNCCL (#44444) · 18c77325
  由 Haohongxiang 提交于 7月 22, 2022
  
  18c77325
13 7月, 2022 2 次提交
- S
  
  fix bug of pp (#44276) · 77c010a0
  由 ShenLiang 提交于 7月 13, 2022
  
  77c010a0
- J
  [Eager] Fix sharding in eager (#44271) · 07c729aa
  由 Jiabin Yang 提交于 7月 13, 2022
```
* fix sharding in eager

* support eager sharding
```
  07c729aa
27 6月, 2022 1 次提交
- W
  [Eager] Rename EagerPyLayer to PyLayer (#43696) · a5dc0a79
  由 wanghuancoder 提交于 6月 27, 2022
```
* rename eagerpylayer
```
  a5dc0a79
14 6月, 2022 1 次提交

Fix numpy 1.20+ deprecation warnings (#42929) · 90cf2299

由 zlsh80826 提交于 6月 14, 2022

* Replace np.bool/np.bool8 with np.bool_

* Replace np.object with np.object_

* Replace np.complex with np.complex128

* Replace np.float with np.float64

* Replace np.int with np.int_

* Rerun pre-commit for newer pre-commit configuration

* Use builtin bool instead of np.bool_ based on the context

90cf2299

05 6月, 2022 1 次提交

【code format check upgrade】 step2：yapf (#42944) · a072fca8

由 Sing_chan 提交于 6月 05, 2022

* use yapf to format all python file

* yapf exclude two unittests file for they rely on writing and reading file, and format will break them

* disable diff_py_file because too many diff files cause command following failed

a072fca8

PaddlePaddle / Paddle 大约 2 年 前同步成功

PaddlePaddle / Paddle
大约 2 年前同步成功