提交 · ade51aa57d3d4357535cdad972307d8854a70d74 · PaddlePaddle / Paddle

30 8月, 2023 2 次提交

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

张

[xdoctest] reformat example code with google style in No.307 (#56595) · 34eecb0e

由张春乔提交于 8月 30, 2023

* weight_norm_hook

* Update weight_norm_hook.py

* Update weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* xdoc

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

34eecb0e

29 8月, 2023 2 次提交
- [Doctest]fix No.218, test=docs_preview (#56730) · 41e72a41
  由 iSerendipity 提交于 8月 29, 2023
  
  41e72a41
- 小
  [xdoctest][task 181-183] reformat example code with google style in... · 51c3c66b
  由小飞猪提交于 8月 29, 2023
```
[xdoctest][task 181-183] reformat example code with google style in `sparse/multiary.py`,`distributed/auto_parallel/*` (#56665)

* [Doctest]fix No.181-183, test=docs_preview

* add env skip
```
  51c3c66b
28 8月, 2023 2 次提交

[xdoctest][task 213,215-217] reformat example code with google style in... · f9c51e8c

由 iLeGend 提交于 8月 28, 2023

[xdoctest][task 213,215-217] reformat example code with google style in `python/paddle/distributed/fleet/base` (#56651)

* [xdoctest][task 213,215-217] reformat example code with google style in python/paddle/distributed/fleet/base

* fix output as comments

f9c51e8c

W
fix fetch problem in pass_utils.py and eval_loss in parallelizer_v2.py (#56539) · c7727885
由 Wennie396 提交于 8月 28, 2023
```
* fix eval_loss bug in parallelizer_v2.py

* fix fetch problem in pass_utils.py
```
c7727885

25 8月, 2023 4 次提交

R

[CustomDevice] add comm context support (#56301) · 62397cd2
由 ronnywang 提交于 8月 25, 2023

62397cd2
W
fix pylayer py39 mem leak (#56623) · ede8fd55
由 wanghuancoder 提交于 8月 25, 2023
```
* fix pylayer py39 mem leak
```
ede8fd55

张

[xdoctest] reformat example code with google style in 192-197 (#55926) · bde10965

由张春乔提交于 8月 25, 2023

* Update input.py

* Update input.py

* Update gather.py

* Update broadcast.py

* Update batch_isend_irecv.py

* Update all_to_all.py

* Update all_reduce.py

* Update all_gather.py

* rollback

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

bde10965

[AutoParallel] remove pyreader, use feed op in pipeline schedule (#56511) · 0012c8d5

由 zhaoyingli 提交于 8月 25, 2023

* modify feed_data for dataloader in pipline parallel mode

* add pre-commit

* remove read op, use feed op

* fix validate batch_size

* tiny fix

* support catch EOFException

* fix conflict

* fix conflict

* fix executor if cond

---------
Co-authored-by: Frida-a <2624653516@qq.com>

0012c8d5

24 8月, 2023 1 次提交
- L
  [SemiAuto] add static branch for shard_tensor (#56561) · dadfb099
  由 Leo Chen 提交于 8月 24, 2023
```
* shard_tensor support static graph

* add comments

* add dy2static ut

* use property in c++ side
```
  dadfb099
23 8月, 2023 1 次提交
- 张
  [xdoctest] reformat example code with google style in No. 203 - 211 (#56473) · 8fe86ebb
  由张春乔提交于 8月 23, 2023
```
* 203

* 204

* 205

* 206

* 207

* 208

* 209

* 210

* 211

* Update all_to_all.py

* Apply suggestions from code review
```
  8fe86ebb
22 8月, 2023 5 次提交

Z

fix supplement_explicit_dependencies when amp-o2 (#56445) · c498ff33
由 zhaoyingli 提交于 8月 22, 2023

c498ff33

[xdoctest] reformat example code with google style No.186-190 (#56166) · 17d6da6b

由 PommesPeter 提交于 8月 22, 2023

* fix: updated code examples.

* fix: added paddle.seed

* fix: updated code style

* Apply suggestions from code review

* refactor: refine detail of code examples

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py

* fix: refine detail

* fix: refine detail

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* refactor: refine detail

* refactor: refine detail

* fix: refine doc

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

17d6da6b

张

[xdoctest] reformat example code with google style in No. 270 275-280 (#56476) · f02261b0
由张春乔提交于 8月 22, 2023

f02261b0

Optimize the memory in the case of using `pipeline` strategy and new executor (#56397) · 9b5f6140

由 lzydev 提交于 8月 22, 2023

* optimize the memory

* fix bug in static_build.cc

* fix bug when using logging

* change the static_build

* fix bug in windows

* fix code accordding to review

9b5f6140

C

update auto tuner in the multi nodes scene (#56374) · 2f69edc5
由 caozhou 提交于 8月 22, 2023

2f69edc5

21 8月, 2023 2 次提交
- G
  
  Change flags for mp async all reduce (#56456) · 88a975a0
  由 Ghost Screaming 提交于 8月 21, 2023
  
  88a975a0
- R
  
  fix dynamic to static when export LLM inference model (#56390) · 95c4bb41
  由 RichardWooSJTU 提交于 8月 21, 2023
  
  95c4bb41
19 8月, 2023 1 次提交
- Y
  
  do not use fuse for sync param in dp (#56437) · 719d96b9
  由 Yuang Liu 提交于 8月 19, 2023
  
  719d96b9
18 8月, 2023 1 次提交
- L
  remove empty block program (#56355) · ee7877e4
  由 Leo Chen 提交于 8月 18, 2023
```
* remove empty block program

* update implementation
```
  ee7877e4
17 8月, 2023 1 次提交

[Custom Device]add run_check support for custom device (#56318) · 0ba4a234

由 Kai Song 提交于 8月 17, 2023

* [Custom Dice]add run_check support for custom device

* fix error msg

* fix typo

* update for all custom device

* fix

* add warning msg

0ba4a234

16 8月, 2023 2 次提交

Add mp_all_reduce asynchronize overlap. (#55662) · 6b1dfb5f

由 Ghost Screaming 提交于 8月 16, 2023

* [WIP] Add mp_all_reduce asynchronize overlap.

* Fix some problems.

* Fix dw compute bug, and use a temporary solution to achieve overlap.

* Use fused_linear_param_grad_add to compute dw.

* Reformat ColumnParallel _overlap_linear. Use environment flags to
control following behaviors:
1. export Flags_mp_aysnc_allreduce=True to turn on mp async all_reduce
2. export Flags_skip_mp_c_identity=True to skip two c_identity operators
   in dygraph mode.
3. export Flags_fused_linear_param_grad_add to enable fused_linear_param_grad_add
   in ColumnParallel backward with mp async all_reduce.

* Polish code.

* Remove useless communication API.

* Fix some problems in mp_async_all_reduce and skip_c_identity.

* Add test cases.

* Remove environment variable Flags_fused_linear_param_grad_add in test case.

* Reset error threshold.

* Reset threshold in test case.

* Add useful log. Remove useless test cases.

6b1dfb5f

make params_grads order same bewteen dynamic and auto_parallel (#56126) · 496422e9

由 zhaoyingli 提交于 8月 16, 2023

* make params_grads order same bewteen dynamic and static mode

* revert inplace clip

* use sorted attribute to control

* tiny fix

* fix find loss_grad_op

496422e9

15 8月, 2023 1 次提交

Fix `sharding_pass` and "nop" op to improve GC strategy (#56283) · ac44d798

由 lzydev 提交于 8月 15, 2023

* Improve GC for pipeline parallel

* Delete print

* fix bug of nop_op and sharding

---------
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>

ac44d798

14 8月, 2023 5 次提交

张

[xdoctest] reformat example code with google style in 202 (#56171) · 08d726f5

由张春乔提交于 8月 14, 2023

* input.py

* Update python/paddle/nn/functional/input.py

* Update input.py

* Update all_gather.py

* Update all_gather.py

08d726f5

张

[xdoctest] reformat example code with google style in No. 231 (#56213) · 61b2bb57
由张春乔提交于 8月 14, 2023

61b2bb57

[AutoTuner] Add GBS search, gpu memory usage (#55466) · 4c0c458a

由 Azure 提交于 8月 14, 2023

* temp commit

* distribute best cfg

* update metric extracting

* fix bugs of prune and reading log

* fix adding cfg bug

* reset status

* remove alarm and set logdir

* deepcopy ctx

* change alarm

* fix restart bug

* best no need alarm

* add gbs search, add gpu memory to history csv, add memory detect

* fix bug

* fix memory read bug; fix etcd connection bug

* fix memory read bug, add oom detection for all ranks

* fix read log and oom detaction, add error code for read log

* add unit test

* Update master.py

---------
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>

4c0c458a

张

[xdoctest] reformat example code with google style in No. 214 (#56212) · 0b6b2d35
由张春乔提交于 8月 14, 2023

0b6b2d35
张

[xdoctest] reformat example code with google style in No. 212 (#56211) · 77da237b
由张春乔提交于 8月 14, 2023

77da237b

11 8月, 2023 4 次提交

L
remove the optimizer base and learning rate base (#56099) · 6eaed2da
由 LoneRanger 提交于 8月 11, 2023
```
* remove the optimizer base and learning rate base

* fix bug

* fix bug
```
6eaed2da
K

enable FLAGS_apply_pass_to_program_in_default (#55911) · 28f74a0e
由 kangguangli 提交于 8月 11, 2023

28f74a0e

repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid... · bfc64801

由 Difer 提交于 8月 11, 2023

repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid with 2.0 version  (#55345)

* repacle fluid.io.load_inference_model

* replace fluid.io.save_inference_model

* fix some bug

* fix some bugs of load & save model

* fix some bug

* fix test_inference_model_io bug

* fix word2vec_inference_model bug

* fix some bug

* fix valueError bug

* fix some bug

* fix a warning error

* for debug

* for debug

* fix io error

* fix test_wordvec_book error

* remove debug print

* fix load_var bug

* for debug cinn test

* revert cinn & fix inference_pass_test in windows

* fix some bugs

* revert cinn & fix inference_pass_test in windows

* for debug vars

* for debug

* fix quant_dequant_test

* fix some path errors

* remove fluid save/load

* fix incubate-fleet save

* move some from fluid.io to static.io

bfc64801

move some fluid apis (#55986) · eafc9889

由 Difer 提交于 8月 11, 2023

* move fluid apis

* fix type error

* remove static exponential_decay

* fix some import error

* remove nn.py

* fix some error

* fix type error

eafc9889

09 8月, 2023 2 次提交

remove the... · 723c6f77

由 LoneRanger 提交于 8月 09, 2023

remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)

* change the optimizer base for SGDOptimizer

* change the optimizer base for SGDOptimizer

* replace the SGDOptimizer with SGD

* fix bug of sgd

* change the optimizer base for MomentumOptimizer

* fix the remaining tests

* remove the Momentum in fluid/optimizer.py

* fix bug

* fix bug

* fix bug

* fix bug

* Update test_resnet_cinn.py

* Update test_resnet_prim_cinn.py

* fix bug

* fix bug

* fix bug

* remove the ModelAverage in fluid

* remove the LookaheadOptimizer in fluid

* fix bug

* remove AdamOptimizer in fluid

* Update test_image_classification_fp16.py

* fix bug

* relocate the ExponentialMovingAverage in fluid

* restore the static api

* remove the FtrlOptimizer in fluid

* remove the DecayedAdagradOptimizer in fluid

* remove the DpsgdOptimizer in fluid

* fix bug

* fix codestyle

* fix bug

* fix bug

* relocate the PipelineOptimizer

* relocate the GradientMergeOptimizer

* fix bug

* fix bug

* fix bug

* fix doc

* Update __init__.py

* Update test_fleet_qat_meta_optimizer.py

* change optimizer base for LarsMomentumOptimizer

* fix bug

* fix conflict

* fix code-style

* fix sample codes

* fix bug

* fix bug

* fix cinn bug

* fix bug

* fix bug

* Update qat_optimizer.py

* Update __init__.py

* fix bug

* change optimizer base for RecomputeOptimizer

* fix bug

* fix bug

* Update test_imperative_optimizer_v2.py

723c6f77

Y

cherry pick #55651 and #55890 (#56063) · fa878846
由 Yuang Liu 提交于 8月 09, 2023

fa878846

08 8月, 2023 3 次提交
- R
  Improve GC for pipeline parallel (#56022) · 28b8adb1
  由 Ruibiao Chen 提交于 8月 08, 2023
```
* Improve GC for pipeline parallel

* Delete print
```
  28b8adb1
- S
  Open FLAGS_new_executor_static_build in auto_parallel (#56016) · d87d8b02
  由 Sonder 提交于 8月 08, 2023
```
* open

* update
```
  d87d8b02
- Y
  
  fix bug for fused_linear_grad_add and main_grad (#56030) · edd5e9a8
  由 Yuang Liu 提交于 8月 08, 2023
  
  edd5e9a8
07 8月, 2023 1 次提交
- L
  Make tcp store as a global instance (#55956) · 0434b828
  由 LiYuRio 提交于 8月 07, 2023
```
* make tcp store a global instance

* fix windows compile error
```
  0434b828

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功