提交 · e9e07a19b6bc0fcfc16ef3e892a558ebf78e05c5 · PaddlePaddle / Paddle

05 9月, 2023 2 次提交

fix some bugs for amp and test case test_tuning_recompute_with_amp.py (#56864) · e9e07a19

由 Wennie396 提交于 9月 05, 2023

* replace amp.use_pure_fp16 with amp.dtype and amp.level

* old api still use use_pure_fp16

* test_fuse_adamw_pass still use use_pure_fp16

* add test case tuning recompute with amp(float16,o2)

* reset new test case properties TIMEOUT 60

* set smaller value of batch_size and batch_num

* deepcopy dist_context fix _rename_input problem

* fix loss name after cast

* set tuning.enable=True and use engine._tune()

* restore some changes in _rename_input()/_rename_output()

* add self.amp_dtype for _cast_loss() in auto_parallel_amp.py

* fix insert op index in _cast_loss()

e9e07a19

小

[xdoctest][task 184-185] reformat example code with google style in... · 1a15a351

由小飞猪提交于 9月 05, 2023

[xdoctest][task 184-185] reformat example code with google style in `distributed/auto_parallel/static/*` (#56666)

* [Doctest]fix No.184,185, test=docs_preview

* add env skip

* fix @staticmethod

* fix

* add xdoctest for v2

* fix

1a15a351

31 8月, 2023 3 次提交
- C
  
  add op cost interface (#56803) · 51ba2a0f
  由 caozhou 提交于 8月 31, 2023
  
  51ba2a0f
- Z
  
  [AutoParallel]organize dataloder in engine (#56788) · 2ea7a6a3
  由 zhaoyingli 提交于 8月 31, 2023
  
  2ea7a6a3
- R
  
  Throw error for NVCC lazy in 1F1B pipeline (#56725) · 25820216
  由 Ruibiao Chen 提交于 8月 31, 2023
  
  25820216
30 8月, 2023 1 次提交

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

29 8月, 2023 1 次提交

小

[xdoctest][task 181-183] reformat example code with google style in... · 51c3c66b

由小飞猪提交于 8月 29, 2023

[xdoctest][task 181-183] reformat example code with google style in `sparse/multiary.py`,`distributed/auto_parallel/*` (#56665)

* [Doctest]fix No.181-183, test=docs_preview

* add env skip

51c3c66b

28 8月, 2023 1 次提交
- W
  fix fetch problem in pass_utils.py and eval_loss in parallelizer_v2.py (#56539) · c7727885
  由 Wennie396 提交于 8月 28, 2023
```
* fix eval_loss bug in parallelizer_v2.py

* fix fetch problem in pass_utils.py
```
  c7727885
25 8月, 2023 1 次提交

[AutoParallel] remove pyreader, use feed op in pipeline schedule (#56511) · 0012c8d5

由 zhaoyingli 提交于 8月 25, 2023

* modify feed_data for dataloader in pipline parallel mode

* add pre-commit

* remove read op, use feed op

* fix validate batch_size

* tiny fix

* support catch EOFException

* fix conflict

* fix conflict

* fix executor if cond

---------
Co-authored-by: Frida-a <2624653516@qq.com>

0012c8d5

24 8月, 2023 1 次提交
- L
  [SemiAuto] add static branch for shard_tensor (#56561) · dadfb099
  由 Leo Chen 提交于 8月 24, 2023
```
* shard_tensor support static graph

* add comments

* add dy2static ut

* use property in c++ side
```
  dadfb099
22 8月, 2023 2 次提交

Z

fix supplement_explicit_dependencies when amp-o2 (#56445) · c498ff33
由 zhaoyingli 提交于 8月 22, 2023

c498ff33

[xdoctest] reformat example code with google style No.186-190 (#56166) · 17d6da6b

由 PommesPeter 提交于 8月 22, 2023

* fix: updated code examples.

* fix: added paddle.seed

* fix: updated code style

* Apply suggestions from code review

* refactor: refine detail of code examples

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py

* fix: refine detail

* fix: refine detail

* Update python/paddle/distributed/auto_parallel/static/process_mesh_v2.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* refactor: refine detail

* refactor: refine detail

* fix: refine doc

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

17d6da6b

16 8月, 2023 1 次提交

make params_grads order same bewteen dynamic and auto_parallel (#56126) · 496422e9

由 zhaoyingli 提交于 8月 16, 2023

* make params_grads order same bewteen dynamic and static mode

* revert inplace clip

* use sorted attribute to control

* tiny fix

* fix find loss_grad_op

496422e9

11 8月, 2023 1 次提交
- L
  remove the optimizer base and learning rate base (#56099) · 6eaed2da
  由 LoneRanger 提交于 8月 11, 2023
```
* remove the optimizer base and learning rate base

* fix bug

* fix bug
```
  6eaed2da
08 8月, 2023 1 次提交
- S
  Open FLAGS_new_executor_static_build in auto_parallel (#56016) · d87d8b02
  由 Sonder 提交于 8月 08, 2023
```
* open

* update
```
  d87d8b02
02 8月, 2023 1 次提交
- Z
  make places configurable for DistributedDataLoader (#55873) · f3b7092c
  由 zhaoyingli 提交于 8月 02, 2023
```
* Update autoparallel DistributedDataLoader

* add places for engine.dataloder()
```
  f3b7092c
31 7月, 2023 1 次提交
- D
  reaplce fill_constant_batch_size_like (#55522) · 6f53d3b2
  由 Difer 提交于 7月 31, 2023
```
* simple reaplce

* for debug

* fix bugs

* fix some bugs

* del fill_constant_batch_size_like
```
  6f53d3b2
24 7月, 2023 1 次提交

[AutoParallel] Add shard tensor and DistAttr api (#55494) · bd60757d

由 Chen Weihang 提交于 7月 24, 2023

* add shard tensor api

* add DistAttr api

* add unittest for coverage

* fix process mesh sample code

* fix checking error

bd60757d

20 7月, 2023 1 次提交
- L
  
  polish some code (#55583) · f172b02f
  由 Leo Chen 提交于 7月 20, 2023
  
  f172b02f
19 7月, 2023 1 次提交
- Z
  
  [AutoParallel] keep lr_sheduler same bewteen executor and engine (#55516) · 36bc5511
  由 zhaoyingli 提交于 7月 19, 2023
  
  36bc5511
13 7月, 2023 1 次提交
- R
  Support nvprof for auto parallel (#55347) · 9210b1af
  由 Ruibiao Chen 提交于 7月 13, 2023
```
* Support nvprof for auto parallel

* Fix CI errors

* Fix CI errors
```
  9210b1af
06 7月, 2023 1 次提交

remove allreduce before c_allgather (#55143) · c234f1f2

由 zhaoyingli 提交于 7月 06, 2023

* remove allreduce before c_allgather

* update reshard insert_fill_constant_op func

* insert_fill_constant_op add shape arg

c234f1f2

29 6月, 2023 1 次提交
- Z
  add skip_gc_vars for 1f1b schedule mode (#54938) · 26980b7b
  由 zhaoyingli 提交于 6月 29, 2023
```
* add skip_gc_vars for 1f1b schedule mode

* add pp_degree and pp_stage
```
  26980b7b
27 6月, 2023 1 次提交

[Semi-Auto] SPMD Parallel Rule Base (#53863) · 6863e2ae

由 JZ-LIANG 提交于 6月 27, 2023

* base rule

* add sharidng merge

* add sharidng axis merge

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* matmul main logic done

* define unified data class for inferencing dist_attr

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>

6863e2ae

25 6月, 2023 1 次提交

auto parallel support pipeline scheduler with standalone executor (#54727) · a702e170

由 zhaoyingli 提交于 6月 25, 2023

* auto parallel support pipeline scheduler with standalone executor

* rm check_fetch

* update cmakelist and flags env

* rm set micro batch id

* rm import

* update utils func

* raise error when merge tensor for return_numpy is False

* fix _pipeline_opt

* fix unittest

a702e170

12 6月, 2023 1 次提交
- N
  
  bump black to 2023 style (#54523) · 44e0393c
  由 Nyakku Shigure 提交于 6月 12, 2023
  
  44e0393c
09 6月, 2023 1 次提交
- N
  bump ruff to 0.0.272 and update config (#54449) · 8f65f72e
  由 Nyakku Shigure 提交于 6月 09, 2023
```
* bump ruff to 0.0.271 and update config

* exclude third_party

* bump ruff to 0.0.272

* refine config
```
  8f65f72e
08 6月, 2023 1 次提交
- L
  eager call all2all to avoid p2p hang in lazy init (#54431) · 56fd25b8
  由 Leo Chen 提交于 6月 08, 2023
```
* eager call all2all to avoid p2p hang in lazy init

* update
```
  56fd25b8
02 6月, 2023 1 次提交
- Z
  [AutoParallel] Add 1F1B Pass (#54260) · 988c58e5
  由 zhaoyingli 提交于 6月 02, 2023
```
* [AutoParallel] add 1F1B

* rm amp
```
  988c58e5
01 6月, 2023 1 次提交

[AutoParallel] update pipeline pass for while control_flow (#54224) · 81c13b86

由 zhaoyingli 提交于 6月 01, 2023

* [AutoParallel] update while control_flow with pipeline

* update process group instantiate

* fix micro_bsz for reshard

* update api for micro batch size

* add strategy for dp optimization

81c13b86

30 5月, 2023 1 次提交
- Y
  [Auto Parallel] Reorganize the fold structure (#54059) · 7f696804
  由 Yulong Ao 提交于 5月 30, 2023
```
* [Auto Parallel] Reorganize the fold structure

* [Auto Parallel] Fix some import errors
```
  7f696804
26 5月, 2023 1 次提交
- Z
  [AutoParallel] update every rank has global view process_groups (#54067) · 1c465824
  由 zhaoyingli 提交于 5月 26, 2023
```
* global view process_group

* fix import

* fix attr

* fix tunner init comm
```
  1c465824
23 5月, 2023 2 次提交
- C
  
  Fix typos (#53960) · d89e0367
  由 co63oc 提交于 5月 23, 2023
  
  d89e0367
- R
  [CustomDevice] fix auto_paralell (#53842) · 3aa5d64e
  由 ronnywang 提交于 5月 23, 2023
```
* [CustomDevice] fix auto_paralell

* update

* update

* update
```
  3aa5d64e
22 5月, 2023 1 次提交

[dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171

由 Meteor Liu 提交于 5月 22, 2023

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* fixed cyclic reference that caused patial import

* fixed bad change

* fix bad import

* fix bad import

* fix bad import

* fix ut failed caused by change in_dynamic_mode

* fix ut failed caused by change in_dynamic_mode

* fixed usage of in_dynamic_mode() or in_dygraph_mode()

* revert python3 to python in .pre-commit-config.yaml

* fix merge conflicts

3794d171

18 5月, 2023 1 次提交
- 张
  rm cmake npu (#53869) · 79ce3fac
  由张春乔提交于 5月 18, 2023
```
* rm cmake npu

* Update generic.cmake

* Update generic.cmake
```
  79ce3fac
16 5月, 2023 2 次提交

[Zero-Dim] update 0d tensor api en doc, test=document_fix (#53823) · 50f0acc0
由 zhouweiwei2014 提交于 5月 16, 2023

50f0acc0

张

由张春乔提交于 5月 16, 2023

* rm npu

* rm use_npu

* rm npuid

* rm use_npu

* rm npuid

* delete npupinned

* roll back sth.

* roll back sth.

* delete npupinned

* roll back sth.

* roll back sth.

* rm npu

* rollback something

* rollback npu identity

* rollback npu identity

5b054d2f

15 5月, 2023 1 次提交
- R
  
  [CustomDevice] add inference MP support, PART3 (#53703) · 56fded1b
  由 ronnywang 提交于 5月 15, 2023
  
  56fded1b
11 5月, 2023 1 次提交
- K
  move DataLoader code to paddle.io (#48699) · 793f3b93
  由 Kaipeng Deng 提交于 5月 11, 2023
```
* move DataLoader to paddle.io. test=develop
```
  793f3b93

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功