提交 · 54fcd9a97285e8e7e6600c58e198467041c9afb3 · PaddlePaddle / Paddle

31 8月, 2023 2 次提交

[AutoParallel] Adapt static spmd rules for dynamic graph (#56367) · 54fcd9a9

由 Chen Weihang 提交于 8月 31, 2023

* move matmul spmd rules into phi

* add basic infer spmd utils

* addspmd factory

* fix compile error

* add unittest

* refine infer spmd test and utils

* debug infer spmd test

* adapt python test

* poish details

* change to vector attr arg

* revert needless change

* update matmul spmd rule test

* remove original rule

* polish details

* fix marco error

* add comment

* pass backward test

* fix compile error

* add cmake rule for spmd_rules_test

* add dist meta tensor

* update pybind impl

* add marco for rules

54fcd9a9

Add approval for using cc_test_old (#56715) · dbc9e5a8

由 risemeup1 提交于 8月 31, 2023

* add cc_test_old approve,test=document_fix

* add cc_test_old approve,test=document_fix

* add cc_test_old approve,test=document_fix

dbc9e5a8

30 8月, 2023 20 次提交

K
[NewIR] fix logical op infermeta (#56711) · 987cb97e
由 kangguangli 提交于 8月 30, 2023
```
* fix logical op infermeta

* add test

* adpat inplace api
```
987cb97e

Add paddle custom flags support (#56256) · 2ef4ec71

由 huangjiyi 提交于 8月 30, 2023

* update

* repalce gflags header

* replace DEFINE_<type> with PD_DEFINE_<type>

* fix bug

* fix bug

* fix bug

* update cmake

* add :: before some paddle namespace

* fix link error

* fix CI-Py3

* allow commandline parse

* fix SetFlagsFromEnv

* fix bug

* fix bug

* fix CI-CINN

* fix CI-Coverage-build

* fix CI-Windows-build

* fix CI-Inference

* fix bug

* fix bug

* fix CI-CINN

* fix inference api test

* fix infer_ut test

* revert infer_ut gflags usage

* update

* fix inference

* remove flags export macro

* revert inference demo_ci gflags usage

* update

* update

* update

* update

* update

* update

* update

* update

* fix bug when turn on WITH_GFLAGS

* turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* update

* update and add unittest

* add unittest

* fix conflict

* rerun ci

* update

* resolve conflict

2ef4ec71

小

[xdoctest][task 239] reformat example code with google style in... · 1c858591

由小飞猪提交于 8月 30, 2023

[xdoctest][task 239] reformat example code with google style in `python/paddle/incubate/asp/asp.py` (#56731)

* [Doctest]fix No.239, test=docs_preview

* fix style

1c858591

张

[xdoctest] reformat example code with google style in No. 201 (#56472) · 5f3c7ba4

由张春乔提交于 8月 30, 2023

* xdoc

* Update python/paddle/tensor/einsum.py

* Update einsum.py

* Apply suggestions from code review

* Update einsum.py

* Apply suggestions from code review

5f3c7ba4

N

[clang-tidy][task 5] enable `modernize-make-shared` and fix existing linter errors (#55807) · ac80251a
由 Nyakku Shigure 提交于 8月 30, 2023

ac80251a
X
Fix bugs of third party (#56670) · e285234c
由 xuxinyi389 提交于 8月 30, 2023
```
* fix bugs of tp

* fix bugs of tp

* fix bugs

* fix bugs

* fix bugs of md5
```
e285234c
R

[ROCM] Remove the constraint with a maximum number of threads per block of 256, P4 (#56702) · 8c154880
由 ronnywang 提交于 8月 30, 2023

8c154880
W

[NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
由 WangZhen 提交于 8月 30, 2023

59b2ad39
K
[NewIR] add_arg_mapping_for_fetch (#56752) · 1692af99
由 kangguangli 提交于 8月 30, 2023
```
* add_arg_mapping_for_fetch

* fix

* fix
```
1692af99

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

[IR] Rigister LegacyKernelOp into KernelDialect (#56680) · ded10442

由 chen2016013 提交于 8月 30, 2023

* Register LegacyKernelDialect & Rigister LegacyKernelOp

* fix code style

* delete LegacyKernelDialect ,register LegacyKernelOp into PaddleKernelDialect

* fix bug

* change as reviewed comments

* bug fix

* bug fix

* try to restart coverage CI

* pass legacy op to kernel pass

* fix code style

* fix code style

* fix code style

ded10442

[Prim][NewIR] Support prim all in new IR (#56614) · e457c298

由 cyber-pioneer 提交于 8月 30, 2023

* support prim all in new ir

* process makefile

* fix rule bug

* polish case

* fix flag

* fix rules bug

e457c298

N

[docs] fix api labels in math.py (#56682) · 5d164968
由 Nyakku Shigure 提交于 8月 30, 2023

5d164968
R

[CustomDevice] Fix error that query a destroyed event (#56745) · c5786be1
由 ronnywang 提交于 8月 30, 2023

c5786be1
N
[clang-tidy][task 61] enable `hicpp-exception-baseclass` and fix existing errors (#55847) · 31a96888
由 Nyakku Shigure 提交于 8月 30, 2023
```
* [clang-tidy] enable `hicpp-exception-baseclass` and fix existing errors

* config

* update error format to pass the ci check (at least 20 chars)
```
31a96888
Y

[Doctest]fix No.303, test=docs_preview (#56777) · 609c0321
由 yoyoIcy 提交于 8月 29, 2023

609c0321
G

[clang-tidy] enable clang-analyzer-optin.cplusplus.UninitializedObject check (#56648) · 6d19073a
由 gouzil 提交于 8月 30, 2023

6d19073a

【complex op】No.6 add complex support for logical_and/or/xor/not (#56323) · 5cbf5bd4

由 iSerendipity 提交于 8月 30, 2023

* 【complex op】No.6 add complex support for logical_and/or/xor/not

* fix dtype check

* modify the docs

* add special condition for not raise when x.dtype is complex

* add random generate for complex dtype

* fix generate for complex

* fix

* fix

* add corner case for complex type

* fix ut

* fix ut

5cbf5bd4

L
[xdoctest] reformat example code with google style in No.6-No.10 (#56146) · fc1e505e
由 LoneRanger 提交于 8月 30, 2023
```
* fix sample code

* fix bug

* fix bug

* Update regularizer.py

* Update __init__.py

* Update decorator.py

* fix code-style
```
fc1e505e

张

[xdoctest] reformat example code with google style in No.307 (#56595) · 34eecb0e

由张春乔提交于 8月 30, 2023

* weight_norm_hook

* Update weight_norm_hook.py

* Update weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py

* Update python/paddle/nn/utils/weight_norm_hook.py
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

* xdoc

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

34eecb0e

29 8月, 2023 18 次提交
- R
  
  [CustomDevice] Not reset pass_builder (#56755) · 220f13bd
  由 ronnywang 提交于 8月 29, 2023
  
  220f13bd
- Z
  [NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c
  由 zhaoyingli 提交于 8月 29, 2023
```
* [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2

* pre-commit

* rm unittest

* tiny fix

* api_gen support send_v2's output is empty

* fix format

* python_c_gen support send_v2
```
  0ce66c1c
- F
  Fix instant variable oom in paddle2cinn (#56662) · df9d9c59
  由 Fisher 提交于 8月 29, 2023
```
When using paddle2cinn, CompilationContext.with_instantiate_variables should be set to false, otherwise CINN will instant and manage variables memory, this leads to double the memory usage, which eventually leads to out of memory error.
This PR will set CompilationContext.with_instantiate_variables to false before context pass to constructing the graph compiler.
```
  df9d9c59
- C
  Vjp autogen for grad list op(split) (#56720) · 128f95a1
  由 Chen Zhiyang 提交于 8月 29, 2023
```
* add vjp code gen for SplitOp

* change vjp manual file name
```
  128f95a1
- 张
  [xdoctest] reformat example code with google style in No. 299 (#56597) · b0b827c7
  由张春乔提交于 8月 29, 2023
```
* Update dlpack.py

* Apply suggestions from code review

* Apply suggestions from code review

* xdoc

* Apply suggestions from code review

* Apply suggestions from code review
```
  b0b827c7
- L
  [New-IR] add pass registry (#56729) · 9999e849
  由 Leo Chen 提交于 8月 29, 2023
```
* add pass registry

* add pass registry macro
```
  9999e849
- 张
  [xdoctest] reformat example code with google style in No. 240 (#56474) · fc1e1b77
  由张春乔提交于 8月 29, 2023
```
* 240

* fix bugs

* fix bugs
```
  fc1e1b77
- S
  Remove need_move_to_phi (#56371) · daac3829
  由 Sonder 提交于 8月 29, 2023
```
* remove flag

* open static build flag

* add searchsorted to list

* add register info for fused layernorm

* fix fused_layernorm_kernel output registe info

* fix stft registe info

* add include

* fix registe info

* add skip fake init for fused_layernorm:residual_out

* fix error

* add distributed_fused_lamb_init to StaticBuildBlackList

* set static_build flag to false
```
  daac3829
- D
  [DCU] support cum & multinomial for dcu (#56612) · 0c3e4cf6
  由 duanyanhui 提交于 8月 29, 2023
```
* support cum & multinomial for dcu

* rm commt
```
  0c3e4cf6
- R
  
  [ROCM] Remove the constraint with a maximum number of threads per block of 256, P2 (#56700) · 76b328bc
  由 ronnywang 提交于 8月 29, 2023
  
  76b328bc
- R
  
  [ROCM] Remove the constraint with a maximum number of threads per block of 256, P3 (#56701) · 593a4428
  由 ronnywang 提交于 8月 29, 2023
  
  593a4428
- [Doctest]fix No.218, test=docs_preview (#56730) · 41e72a41
  由 iSerendipity 提交于 8月 29, 2023
  
  41e72a41
- 小
  [xdoctest][task 200] reformat example code with google style in... · d64deaae
  由小飞猪提交于 8月 29, 2023
```
[xdoctest][task 200] reformat example code with google style in `python/paddle/tensor/creation.py` (#56685)

* [Doctest]fix No.200, test=docs_preview

* fix output

* add retain_grads

* fix style
```
  d64deaae
- 张
  
  Modify the docs of UpsamplingNearest2D and UpsamplingBilinear2D (#56728) · 5583f277
  由张春乔提交于 8月 29, 2023
  
  5583f277
- X
  【new ir】modify test comp divide_grad (#56697) · 7a633e64
  由 xiaoguoguo626807 提交于 8月 29, 2023
```
* modify test comp grad

* modify test comp grad
```
  7a633e64
- X
  
  Modified the document for the paddle.diff() function. (#56736) · ad93dc0c
  由 Xavier ZXY 提交于 8月 29, 2023
  
  ad93dc0c
- G
  [Fluid] move lars_momentum to phi (#55798) · b0c2ee26
  由 gouzil 提交于 8月 29, 2023
```
* [Fluid] move lars_momentum to phi

* add sig

* fix optional Output

* off check_dygraph

* fix input

* fix operator[]

* fix

* try fix AllocateTmpTensor

* fix

* fix type

* Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu

* fix type

* rollback

* Add Registration

* try fix win

* try fix win

* try use double

* try use operator *(float,const Derived &)

* try auto

* fix

* fix

* fix

* fix dtype

* fix type

* fix index
```
  b0c2ee26
- L
  
  make variable_length_memory_efficient_attention supports mask_broadcast_heads (#56673) · 6839a7b9
  由 lzy 提交于 8月 29, 2023
  
  6839a7b9

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功