提交 · 7d8402a87fa88a689005aef2b23ff85fdb8d496b · PaddlePaddle / Paddle

04 9月, 2023 6 次提交
- H
  fix paddle namespace conflict when using paddle_flags (#56913) · 7d8402a8
  由 huangjiyi 提交于 9月 04, 2023
```
* update

* update

* update
```
  7d8402a8
- D
  
  optimize softmax_mask_fuse (#56877) · 25a0b46d
  由 duanyanhui 提交于 9月 04, 2023
  
  25a0b46d
- L
  
  reshard r to p (#56833) · a28e6f63
  由 LiYuRio 提交于 9月 04, 2023
  
  a28e6f63
- fix symbol redefined (#56766) · 413ca989
  由 engineer1109 提交于 9月 04, 2023
  
  413ca989
- W
  fix contiguous (#56863) · d7fc3781
  由 wanghuancoder 提交于 9月 04, 2023
```
* fix contiguous
```
  d7fc3781
- W
  
  fix set value inplace strided bug (#56892) · 83b942f3
  由 wanghuancoder 提交于 9月 04, 2023
  
  83b942f3
02 9月, 2023 4 次提交
- L
  polish code of pass and executor (#56886) · d74bfefe
  由 Leo Chen 提交于 9月 02, 2023
```
* polish code of pass and executor

* update ut
```
  d74bfefe
- A
  [NewIR]Refine and Split CINN Dilact directory (#56805) · 061bb9d5
  由 Aurelius84 提交于 9月 02, 2023
```
* [NewIR]Refine CINN Dilact directory

* fix conflict

* fix deps

* fix unittest deps
```
  061bb9d5
- R
  link C++ tests to libpaddle.so (#56829) · fa75ebeb
  由 risemeup1 提交于 9月 02, 2023
```
* link C++ tests to libpaddle.so except windows

* fix compile kill-9 bug

* fix compile kill-9 bug

* fix compile kill-9 bug

* fix compile kill-9 bug
```
  fa75ebeb
- C
  
  delete pd_op.yaml (#56862) · e110cbb4
  由 chen2016013 提交于 9月 02, 2023
  
  e110cbb4
01 9月, 2023 7 次提交

[PRIM][IR]Complete IR vjp code gen for more vjp code gen (#56798) · 4abea956

由 Charles-hit 提交于 9月 01, 2023

* Fix attr type error like concat axis

* Fix None input error

* Fix intermediate output

* support vjp code gen

---------
Co-authored-by: N0x45f <wangzhen45@baidu.com>

4abea956

【Complex op】add complex support for index_select and index_sample (#56457) · 0b608393

由 Scotty 提交于 9月 01, 2023

* support index_select op

* index_sample in cpu

* support index_sample in gpu

* change data_transform

* fix api gen and use skip_transform in yaml

0b608393

[NewIR]Part-2.1 Refactor NewIRCompiler to support Group Ops (#56762) · 7adb4703

由 Aurelius84 提交于 9月 01, 2023

* [NewIR]Part-2.1 Refactor NewIRCompiler to support Group Ops

* fix gflags link error

* fix include ir_printer.h

* fix unittest

* fix conflict

* fix flags

* fix comment

7adb4703

G

[clang-tidy] enable bugprone-incorrect-roundings check (#56747) · e8a96347
由 gouzil 提交于 9月 01, 2023

e8a96347

[clang-tidy] No.34,36 enable... · 17e4be21

由 cyberslack_lee 提交于 9月 01, 2023

[clang-tidy] No.34,36 enable performance-noexcept-move-constructor,modernize-use-transparent-functors (#56261)

* fix

* fix

* CI

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* CI

* fix

* CI

17e4be21

[IR] Generate pd_op.parsed.yaml from pd_op.yaml (#56674) · 962f67d2

由 chen2016013 提交于 9月 01, 2023

* Generate pd_op.parsed.yaml from pd_op.yaml

* Generate pd_op.parsed.yaml from pd_op.yaml

* fix bug

* bug fix

* bug fix

* bug fix

* 向pd_ops.yaml中新增算子 & 修改pd_ops.parsed.yaml存放路径

* 修复路径依赖bug & 添加 .gitignore文件

* fix bug - compat input args in save_combine op

* fix compat file

* fix set_value_with_tensor yaml

* split backward op in original yaml file

* add send_v2 & recv_v2

962f67d2

Z
[IR] Refine Int64 attribute translator logic (#56842) · 84ec8092
由 zhangbo9674 提交于 9月 01, 2023
```
* fix bug

* fix bug
```
84ec8092

31 8月, 2023 8 次提交
- H
  [NewIR]Fix install check bug (#56768) · 323566d5
  由 hong 提交于 8月 31, 2023
```
* fix install check bug

* fix bug
```
  323566d5
- L
  
  skip data_transfer for save op (#56775) · a31cf3d2
  由 Leo Chen 提交于 8月 31, 2023
  
  a31cf3d2
- L
  Add elementwise_add into Paddle-TRT NHWC support (#56795) · 97b09e81
  由 Leo Chen 提交于 8月 31, 2023
```
* Add elementwise_add support into NHWC IR
```
  97b09e81
- H
  [NewIR]New ir using kernel registrer type (#56789) · a34bdb64
  由 hong 提交于 8月 31, 2023
```
* update

* fix batch norm grad args def

* fix bug

* fix combine slice bug

* fix slice bug

* update builtin split

* disable using kernel resigter dtype

* polish code

* disable some test
```
  a34bdb64
- L
  
  use macro instead of functor (#56726) · 5425ad7f
  由 LiYuRio 提交于 8月 31, 2023
  
  5425ad7f
- Z
  
  [Fluid] Move distributed_fused_lamb_init to phi (#55993) · 0bc369ef
  由 Zero Rains 提交于 8月 31, 2023
  
  0bc369ef
- R
  
  [ROCM] Remove the constraint with a maximum number of threads per block of 256, P1 (#56699) · d7679426
  由 ronnywang 提交于 8月 31, 2023
  
  d7679426
- C
  [AutoParallel] Adapt static spmd rules for dynamic graph (#56367) · 54fcd9a9
  由 Chen Weihang 提交于 8月 31, 2023
```
* move matmul spmd rules into phi

* add basic infer spmd utils

* addspmd factory

* fix compile error

* add unittest

* refine infer spmd test and utils

* debug infer spmd test

* adapt python test

* poish details

* change to vector attr arg

* revert needless change

* update matmul spmd rule test

* remove original rule

* polish details

* fix marco error

* add comment

* pass backward test

* fix compile error

* add cmake rule for spmd_rules_test

* add dist meta tensor

* update pybind impl

* add marco for rules
```
  54fcd9a9
30 8月, 2023 8 次提交

Add paddle custom flags support (#56256) · 2ef4ec71

由 huangjiyi 提交于 8月 30, 2023

* update

* repalce gflags header

* replace DEFINE_<type> with PD_DEFINE_<type>

* fix bug

* fix bug

* fix bug

* update cmake

* add :: before some paddle namespace

* fix link error

* fix CI-Py3

* allow commandline parse

* fix SetFlagsFromEnv

* fix bug

* fix bug

* fix CI-CINN

* fix CI-Coverage-build

* fix CI-Windows-build

* fix CI-Inference

* fix bug

* fix bug

* fix CI-CINN

* fix inference api test

* fix infer_ut test

* revert infer_ut gflags usage

* update

* fix inference

* remove flags export macro

* revert inference demo_ci gflags usage

* update

* update

* update

* update

* update

* update

* update

* update

* fix bug when turn on WITH_GFLAGS

* turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* update

* update and add unittest

* add unittest

* fix conflict

* rerun ci

* update

* resolve conflict

2ef4ec71

N

[clang-tidy][task 5] enable `modernize-make-shared` and fix existing linter errors (#55807) · ac80251a
由 Nyakku Shigure 提交于 8月 30, 2023

ac80251a
W

[NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
由 WangZhen 提交于 8月 30, 2023

59b2ad39
K
[NewIR] add_arg_mapping_for_fetch (#56752) · 1692af99
由 kangguangli 提交于 8月 30, 2023
```
* add_arg_mapping_for_fetch

* fix

* fix
```
1692af99

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

[IR] Rigister LegacyKernelOp into KernelDialect (#56680) · ded10442

由 chen2016013 提交于 8月 30, 2023

* Register LegacyKernelDialect & Rigister LegacyKernelOp

* fix code style

* delete LegacyKernelDialect ,register LegacyKernelOp into PaddleKernelDialect

* fix bug

* change as reviewed comments

* bug fix

* bug fix

* try to restart coverage CI

* pass legacy op to kernel pass

* fix code style

* fix code style

* fix code style

ded10442

R

[CustomDevice] Fix error that query a destroyed event (#56745) · c5786be1
由 ronnywang 提交于 8月 30, 2023

c5786be1
N
[clang-tidy][task 61] enable `hicpp-exception-baseclass` and fix existing errors (#55847) · 31a96888
由 Nyakku Shigure 提交于 8月 30, 2023
```
* [clang-tidy] enable `hicpp-exception-baseclass` and fix existing errors

* config

* update error format to pass the ci check (at least 20 chars)
```
31a96888

29 8月, 2023 7 次提交

R

[CustomDevice] Not reset pass_builder (#56755) · 220f13bd
由 ronnywang 提交于 8月 29, 2023

220f13bd

[NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c

由 zhaoyingli 提交于 8月 29, 2023

* [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2

* pre-commit

* rm unittest

* tiny fix

* api_gen support send_v2's output is empty

* fix format

* python_c_gen support send_v2

0ce66c1c

Fix instant variable oom in paddle2cinn (#56662) · df9d9c59

由 Fisher 提交于 8月 29, 2023

When using paddle2cinn, CompilationContext.with_instantiate_variables should be set to false, otherwise CINN will instant and manage variables memory, this leads to double the memory usage, which eventually leads to out of memory error.
This PR will set CompilationContext.with_instantiate_variables to false before context pass to constructing the graph compiler.

df9d9c59

C
Vjp autogen for grad list op(split) (#56720) · 128f95a1
由 Chen Zhiyang 提交于 8月 29, 2023
```
* add vjp code gen for SplitOp

* change vjp manual file name
```
128f95a1
L
[New-IR] add pass registry (#56729) · 9999e849
由 Leo Chen 提交于 8月 29, 2023
```
* add pass registry

* add pass registry macro
```
9999e849

Remove need_move_to_phi (#56371) · daac3829

由 Sonder 提交于 8月 29, 2023

* remove flag

* open static build flag

* add searchsorted to list

* add register info for fused layernorm

* fix fused_layernorm_kernel output registe info

* fix stft registe info

* add include

* fix registe info

* add skip fake init for fused_layernorm:residual_out

* fix error

* add distributed_fused_lamb_init to StaticBuildBlackList

* set static_build flag to false

daac3829

[Fluid] move lars_momentum to phi (#55798) · b0c2ee26

由 gouzil 提交于 8月 29, 2023

* [Fluid] move lars_momentum to phi

* add sig

* fix optional Output

* off check_dygraph

* fix input

* fix operator[]

* fix

* try fix AllocateTmpTensor

* fix

* fix type

* Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu

* fix type

* rollback

* Add Registration

* try fix win

* try fix win

* try use double

* try use operator *(float,const Derived &)

* try auto

* fix

* fix

* fix

* fix dtype

* fix type

* fix index

b0c2ee26

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功