提交 · 2857fdbbd10e2b8bee4df1ac4c0b02c9865f4828 · PaddlePaddle / Paddle

07 9月, 2023 1 次提交

[NewIR] Update send recv infermeta and add unittest (#56794) · 2857fdbb

由 zhaoyingli 提交于 9月 07, 2023

* [NewIR]Update send recv infermeta and add unittest

* rm new ir flag

* rm fluid api

* skip runing startup prog

* update flag name

* update recv_v2 yaml

* fix conflict

* unittest only for pp

* fix cmakelist

* unittest check precision

* control random

* fix cmakelist

2857fdbb

06 9月, 2023 2 次提交
- J
  
  update doc (#56957) · c96b9cbb
  由 JZ-LIANG 提交于 9月 06, 2023
  
  c96b9cbb
- add sep group (#56271) · d121cf29
  由 zhenhailiu 提交于 9月 06, 2023
```
* sep group

* add test

* test ok

* polish

* test cmake script generated

* add sep group

* format

* polish

* polish
```
  d121cf29
05 9月, 2023 4 次提交

Add attributes to support to analyse the stream across interpreters (#56814) · f5497fd0

由 lzydev 提交于 9月 05, 2023

* fix static_build for pp

* add mannual_event to support streams across progs

* revert static_build.sh

* fix coverage-ci

* modify the method to name events

* change code according to review

f5497fd0

fix some bugs for amp and test case test_tuning_recompute_with_amp.py (#56864) · e9e07a19

由 Wennie396 提交于 9月 05, 2023

* replace amp.use_pure_fp16 with amp.dtype and amp.level

* old api still use use_pure_fp16

* test_fuse_adamw_pass still use use_pure_fp16

* add test case tuning recompute with amp(float16,o2)

* reset new test case properties TIMEOUT 60

* set smaller value of batch_size and batch_num

* deepcopy dist_context fix _rename_input problem

* fix loss name after cast

* set tuning.enable=True and use engine._tune()

* restore some changes in _rename_input()/_rename_output()

* add self.amp_dtype for _cast_loss() in auto_parallel_amp.py

* fix insert op index in _cast_loss()

e9e07a19

小

[xdoctest][task 184-185] reformat example code with google style in... · 1a15a351

由小飞猪提交于 9月 05, 2023

[xdoctest][task 184-185] reformat example code with google style in `distributed/auto_parallel/static/*` (#56666)

* [Doctest]fix No.184,185, test=docs_preview

* add env skip

* fix @staticmethod

* fix

* add xdoctest for v2

* fix

1a15a351

[xdoctest][task 224-225] reformat example code with google style in... · 53d0869f

由 iSerendipity 提交于 9月 05, 2023

[xdoctest][task 224-225] reformat example code with google style in `python/paddle/distributed/fleet` (#56815)

* [Doctest]fix No.224-225, test=docs_preview

* fix the AttributeError

53d0869f

04 9月, 2023 1 次提交
- S
  
  fix bug in c_split (#56917) · 201480d5
  由 ShenLiang 提交于 9月 04, 2023
  
  201480d5
01 9月, 2023 2 次提交
- Y
  
  change some default values of optim flags (#56847) · 396fd4c0
  由 Yuang Liu 提交于 9月 01, 2023
  
  396fd4c0
- Y
  
  change default value of sharding related env (#56835) · 0d081357
  由 Yuang Liu 提交于 9月 01, 2023
  
  0d081357
31 8月, 2023 5 次提交
- C
  
  add op cost interface (#56803) · 51ba2a0f
  由 caozhou 提交于 8月 31, 2023
  
  51ba2a0f
- Z
  
  [AutoParallel]organize dataloder in engine (#56788) · 2ea7a6a3
  由 zhaoyingli 提交于 8月 31, 2023
  
  2ea7a6a3
- R
  
  Throw error for NVCC lazy in 1F1B pipeline (#56725) · 25820216
  由 Ruibiao Chen 提交于 8月 31, 2023
  
  25820216
- Y
  
  recompute support tuple (#56793) · bb2310a6
  由 Yuang Liu 提交于 8月 31, 2023
  
  bb2310a6
- 张
  [xdoctest] reformat example code with google style in No. 309 (#56596) · f3fa2ed3
  由张春乔提交于 8月 31, 2023
```
* input.py

* Update python/paddle/nn/functional/input.py

* Update input.py

* Update all_gather.py

* Update all_gather.py

* xdoc

* Apply suggestions from code review

* Update python/paddle/distributed/models/moe/utils.py

* Apply suggestions from code review
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
```
  f3fa2ed3
30 8月, 2023 2 次提交

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

张

[xdoctest] reformat example code with google style in No.307 (#56595) · 34eecb0e

由张春乔提交于 8月 30, 2023

* weight_norm_hook

* Update weight_norm_hook.py

* Update weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* xdoc

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

34eecb0e

29 8月, 2023 2 次提交
- [Doctest]fix No.218, test=docs_preview (#56730) · 41e72a41
  由 iSerendipity 提交于 8月 29, 2023
  
  41e72a41
- 小
  [xdoctest][task 181-183] reformat example code with google style in... · 51c3c66b
  由小飞猪提交于 8月 29, 2023
```
[xdoctest][task 181-183] reformat example code with google style in `sparse/multiary.py`,`distributed/auto_parallel/*` (#56665)

* [Doctest]fix No.181-183, test=docs_preview

* add env skip
```
  51c3c66b
28 8月, 2023 2 次提交

[xdoctest][task 213,215-217] reformat example code with google style in... · f9c51e8c

由 iLeGend 提交于 8月 28, 2023

[xdoctest][task 213,215-217] reformat example code with google style in `python/paddle/distributed/fleet/base` (#56651)

* [xdoctest][task 213,215-217] reformat example code with google style in python/paddle/distributed/fleet/base

* fix output as comments

f9c51e8c

W
fix fetch problem in pass_utils.py and eval_loss in parallelizer_v2.py (#56539) · c7727885
由 Wennie396 提交于 8月 28, 2023
```
* fix eval_loss bug in parallelizer_v2.py

* fix fetch problem in pass_utils.py
```
c7727885

25 8月, 2023 4 次提交

R

[CustomDevice] add comm context support (#56301) · 62397cd2
由 ronnywang 提交于 8月 25, 2023

62397cd2
W
fix pylayer py39 mem leak (#56623) · ede8fd55
由 wanghuancoder 提交于 8月 25, 2023
```
* fix pylayer py39 mem leak
```
ede8fd55

张

[xdoctest] reformat example code with google style in 192-197 (#55926) · bde10965

由张春乔提交于 8月 25, 2023

* Update input.py

* Update input.py

* Update gather.py

* Update broadcast.py

* Update batch_isend_irecv.py

* Update all_to_all.py

* Update all_reduce.py

* Update all_gather.py

* rollback

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

bde10965

[AutoParallel] remove pyreader, use feed op in pipeline schedule (#56511) · 0012c8d5

由 zhaoyingli 提交于 8月 25, 2023

* modify feed_data for dataloader in pipline parallel mode

* add pre-commit

* remove read op, use feed op

* fix validate batch_size

* tiny fix

* support catch EOFException

* fix conflict

* fix conflict

* fix executor if cond

---------
Co-authored-by: Frida-a <2624653516@qq.com>

0012c8d5

24 8月, 2023 1 次提交
- L
  [SemiAuto] add static branch for shard_tensor (#56561) · dadfb099
  由 Leo Chen 提交于 8月 24, 2023
```
* shard_tensor support static graph

* add comments

* add dy2static ut

* use property in c++ side
```
  dadfb099
23 8月, 2023 1 次提交
- 张
  [xdoctest] reformat example code with google style in No. 203 - 211 (#56473) · 8fe86ebb
  由张春乔提交于 8月 23, 2023
```
* 203

* 204

* 205

* 206

* 207

* 208

* 209

* 210

* 211

* Update all_to_all.py

* Apply suggestions from code review
```
  8fe86ebb
22 8月, 2023 5 次提交

Z

fix supplement_explicit_dependencies when amp-o2 (#56445) · c498ff33
由 zhaoyingli 提交于 8月 22, 2023

c498ff33

[xdoctest] reformat example code with google style No.186-190 (#56166) · 17d6da6b

由 PommesPeter 提交于 8月 22, 2023

* fix: updated code examples.

* fix: added paddle.seed

* fix: updated code style

* Apply suggestions from code review

* refactor: refine detail of code examples

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py

* fix: refine detail

* fix: refine detail

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* refactor: refine detail

* refactor: refine detail

* fix: refine doc

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

17d6da6b

张

[xdoctest] reformat example code with google style in No. 270 275-280 (#56476) · f02261b0
由张春乔提交于 8月 22, 2023

f02261b0

Optimize the memory in the case of using `pipeline` strategy and new executor (#56397) · 9b5f6140

由 lzydev 提交于 8月 22, 2023

* optimize the memory

* fix bug in static_build.cc

* fix bug when using logging

* change the static_build

* fix bug in windows

* fix code accordding to review

9b5f6140

C

update auto tuner in the multi nodes scene (#56374) · 2f69edc5
由 caozhou 提交于 8月 22, 2023

2f69edc5

21 8月, 2023 2 次提交
- G
  
  Change flags for mp async all reduce (#56456) · 88a975a0
  由 Ghost Screaming 提交于 8月 21, 2023
  
  88a975a0
- R
  
  fix dynamic to static when export LLM inference model (#56390) · 95c4bb41
  由 RichardWooSJTU 提交于 8月 21, 2023
  
  95c4bb41
19 8月, 2023 1 次提交
- Y
  
  do not use fuse for sync param in dp (#56437) · 719d96b9
  由 Yuang Liu 提交于 8月 19, 2023
  
  719d96b9
18 8月, 2023 1 次提交
- L
  remove empty block program (#56355) · ee7877e4
  由 Leo Chen 提交于 8月 18, 2023
```
* remove empty block program

* update implementation
```
  ee7877e4
17 8月, 2023 1 次提交

[Custom Device]add run_check support for custom device (#56318) · 0ba4a234

由 Kai Song 提交于 8月 17, 2023

* [Custom Dice]add run_check support for custom device

* fix error msg

* fix typo

* update for all custom device

* fix

* add warning msg

0ba4a234

16 8月, 2023 2 次提交

Add mp_all_reduce asynchronize overlap. (#55662) · 6b1dfb5f

由 Ghost Screaming 提交于 8月 16, 2023

* [WIP] Add mp_all_reduce asynchronize overlap.

* Fix some problems.

* Fix dw compute bug, and use a temporary solution to achieve overlap.

* Use fused_linear_param_grad_add to compute dw.

* Reformat ColumnParallel _overlap_linear. Use environment flags to
control following behaviors:
1. export Flags_mp_aysnc_allreduce=True to turn on mp async all_reduce
2. export Flags_skip_mp_c_identity=True to skip two c_identity operators
   in dygraph mode.
3. export Flags_fused_linear_param_grad_add to enable fused_linear_param_grad_add
   in ColumnParallel backward with mp async all_reduce.

* Polish code.

* Remove useless communication API.

* Fix some problems in mp_async_all_reduce and skip_c_identity.

* Add test cases.

* Remove environment variable Flags_fused_linear_param_grad_add in test case.

* Reset error threshold.

* Reset threshold in test case.

* Add useful log. Remove useless test cases.

6b1dfb5f

make params_grads order same bewteen dynamic and auto_parallel (#56126) · 496422e9

由 zhaoyingli 提交于 8月 16, 2023

* make params_grads order same bewteen dynamic and static mode

* revert inplace clip

* use sorted attribute to control

* tiny fix

* fix find loss_grad_op

496422e9

15 8月, 2023 1 次提交

Fix `sharding_pass` and "nop" op to improve GC strategy (#56283) · ac44d798

由 lzydev 提交于 8月 15, 2023

* Improve GC for pipeline parallel

* Delete print

* fix bug of nop_op and sharding

---------
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>

ac44d798

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功