提交 · 2ef4ec71117f77bfe2021fbbf624e617098f3abd · PaddlePaddle / Paddle

30 8月, 2023 8 次提交

Add paddle custom flags support (#56256) · 2ef4ec71

由 huangjiyi 提交于 8月 30, 2023

* update

* repalce gflags header

* replace DEFINE_<type> with PD_DEFINE_<type>

* fix bug

* fix bug

* fix bug

* update cmake

* add :: before some paddle namespace

* fix link error

* fix CI-Py3

* allow commandline parse

* fix SetFlagsFromEnv

* fix bug

* fix bug

* fix CI-CINN

* fix CI-Coverage-build

* fix CI-Windows-build

* fix CI-Inference

* fix bug

* fix bug

* fix CI-CINN

* fix inference api test

* fix infer_ut test

* revert infer_ut gflags usage

* update

* fix inference

* remove flags export macro

* revert inference demo_ci gflags usage

* update

* update

* update

* update

* update

* update

* update

* update

* fix bug when turn on WITH_GFLAGS

* turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* fix bug when turn on WITH_GFLAGS

* update

* update and add unittest

* add unittest

* fix conflict

* rerun ci

* update

* resolve conflict

2ef4ec71

N

[clang-tidy][task 5] enable `modernize-make-shared` and fix existing linter errors (#55807) · ac80251a
由 Nyakku Shigure 提交于 8月 30, 2023

ac80251a
W

[NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
由 WangZhen 提交于 8月 30, 2023

59b2ad39
K
[NewIR] add_arg_mapping_for_fetch (#56752) · 1692af99
由 kangguangli 提交于 8月 30, 2023
```
* add_arg_mapping_for_fetch

* fix

* fix
```
1692af99

[Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5

由 Ghost Screaming 提交于 8月 30, 2023

* for verify

fluid operator support new comm library

* u

* u

* u

* compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.

* Remove useless comments in process_group.py

* Polish code style.

* Fix some problems.

* Remove use fluid api in phi comm_context_manager.

* Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.

* Fix bug of HIP architecture.

* Fix some problems.
1. remove useless loggings.
2. Fix conditional compilation for HIP.
3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
set, as a result, NCCLCommContext's dev_ctx is not initialized.

* Fix some problems.

* Polish code.

* Polish code.

* Revert compatiable upgrade for communication operators. Their upgrades
will be submitted in another PR.

* Remove StaticTCPStore.

* Remove useless modification.

* Remove useless set_cuda_device_id.

* Polish code.

* Remove fluid header files in phi files.

* Remove useless comments.

* Fix problems of hip arch.

* Fix some problems.

* Polish code.

* Polish code style.

---------
Co-authored-by: hitywt <yuwentao126@126.com>

ade51aa5

[IR] Rigister LegacyKernelOp into KernelDialect (#56680) · ded10442

由 chen2016013 提交于 8月 30, 2023

* Register LegacyKernelDialect & Rigister LegacyKernelOp

* fix code style

* delete LegacyKernelDialect ,register LegacyKernelOp into PaddleKernelDialect

* fix bug

* change as reviewed comments

* bug fix

* bug fix

* try to restart coverage CI

* pass legacy op to kernel pass

* fix code style

* fix code style

* fix code style

ded10442

R

[CustomDevice] Fix error that query a destroyed event (#56745) · c5786be1
由 ronnywang 提交于 8月 30, 2023

c5786be1
N
[clang-tidy][task 61] enable `hicpp-exception-baseclass` and fix existing errors (#55847) · 31a96888
由 Nyakku Shigure 提交于 8月 30, 2023
```
* [clang-tidy] enable `hicpp-exception-baseclass` and fix existing errors

* config

* update error format to pass the ci check (at least 20 chars)
```
31a96888

29 8月, 2023 13 次提交

R

[CustomDevice] Not reset pass_builder (#56755) · 220f13bd
由 ronnywang 提交于 8月 29, 2023

220f13bd

[NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c

由 zhaoyingli 提交于 8月 29, 2023

* [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2

* pre-commit

* rm unittest

* tiny fix

* api_gen support send_v2's output is empty

* fix format

* python_c_gen support send_v2

0ce66c1c

Fix instant variable oom in paddle2cinn (#56662) · df9d9c59

由 Fisher 提交于 8月 29, 2023

When using paddle2cinn, CompilationContext.with_instantiate_variables should be set to false, otherwise CINN will instant and manage variables memory, this leads to double the memory usage, which eventually leads to out of memory error.
This PR will set CompilationContext.with_instantiate_variables to false before context pass to constructing the graph compiler.

df9d9c59

C
Vjp autogen for grad list op(split) (#56720) · 128f95a1
由 Chen Zhiyang 提交于 8月 29, 2023
```
* add vjp code gen for SplitOp

* change vjp manual file name
```
128f95a1
L
[New-IR] add pass registry (#56729) · 9999e849
由 Leo Chen 提交于 8月 29, 2023
```
* add pass registry

* add pass registry macro
```
9999e849

Remove need_move_to_phi (#56371) · daac3829

由 Sonder 提交于 8月 29, 2023

* remove flag

* open static build flag

* add searchsorted to list

* add register info for fused layernorm

* fix fused_layernorm_kernel output registe info

* fix stft registe info

* add include

* fix registe info

* add skip fake init for fused_layernorm:residual_out

* fix error

* add distributed_fused_lamb_init to StaticBuildBlackList

* set static_build flag to false

daac3829

[Fluid] move lars_momentum to phi (#55798) · b0c2ee26

由 gouzil 提交于 8月 29, 2023

* [Fluid] move lars_momentum to phi

* add sig

* fix optional Output

* off check_dygraph

* fix input

* fix operator[]

* fix

* try fix AllocateTmpTensor

* fix

* fix type

* Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu

* fix type

* rollback

* Add Registration

* try fix win

* try fix win

* try use double

* try use operator *(float,const Derived &)

* try auto

* fix

* fix

* fix

* fix dtype

* fix type

* fix index

b0c2ee26

Z
Revert "[NewIR]Fix new ir output dtype bug (#56620)" (#56739) · f5d9981e
由 zhangbo9674 提交于 8月 29, 2023
```
This reverts commit 1409e4ec.
```
f5d9981e
C
[clang-tidy] No.26,27 enable misc-unused-using-decls,misc-unused-alias-decls (#56485) · 138bdf40
由 cyberslack_lee 提交于 8月 29, 2023
```
* fix

* fix
```
138bdf40
X
[clang-tidy] No. 53,54 enable cppcoreguidelines-c-copy-assignment-signature... · cc9e8699
由 xiaoye 提交于 8月 29, 2023
```
[clang-tidy] No. 53,54 enable cppcoreguidelines-c-copy-assignment-signature and bugprone-use-after-move (#56601)
```
cc9e8699
N

[clang-tidy] enable `modernize-raw-string-literal` and fix existing errors (#55675) · 241f97d5
由 Nyakku Shigure 提交于 8月 29, 2023

241f97d5
G

[clang-tidy] enable bugprone-unhandled-self-assignment check (#56640) · b185adf8
由 gouzil 提交于 8月 29, 2023

b185adf8
G

[clang-tidy] enable performance-inefficient-string-concatenation check (#56647) · 0236771e
由 gouzil 提交于 8月 29, 2023

0236771e

28 8月, 2023 9 次提交

[NewIR]Fix new ir output dtype bug (#56620) · 1409e4ec

由 hong 提交于 8月 28, 2023

* update

* fix batch norm grad args def

* fix bug

* fix combine slice bug

* fix slice bug

* update builtin split

1409e4ec

G

[clang-tidy] enable bugprone-exception-escape check (#56692) · dcaca0f4
由 gouzil 提交于 8月 28, 2023

dcaca0f4

【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a

由 GGBond8488 提交于 8月 28, 2023

【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)

* tmp commit

* add atan2

* add inplace api

* fix error

* add inpalce divide

* add inplace api

* add more inplace

* add more inpalce

* fix logical_not error

* support sinh and cosh in cpu

* support asin, acos, atan, asinh, acosh, atanh in cpu

* fix typro

* fix typro

* mv out atan2 ldexp

* mv out atan2 ldexp

* support sinh and cosh in gpu

* support asin, acos, atan, asinh, acosh, atanh in gpu

* fix ge error

* fix dygraph commpare error

* fix dygraph commpare error

* check complex in python

* fix cast inpalce error

* open inplace test

* fix ops.yaml error

* mv cast inpalce to python

* fix coverage ci

* add last inplace

* fix inplace error

* fix cast error

* fix error

* add nan_to_num_

* fix typro

* fix sparse cast error

* remove gpu 4

* fix static cast error

* tmp commit

* add atan2

* add inplace api

* fix error

* add inpalce divide

* add inplace api

* add more inplace

* add more inpalce

* fix logical_not error

* fix typro

* fix typro

* mv out atan2 ldexp

* mv out atan2 ldexp

* fix ge error

* fix dygraph commpare error

* fix dygraph commpare error

* fix cast inpalce error

* open inplace test

* fix ops.yaml error

* mv cast inpalce to python

* fix coverage ci

* add last inplace

* fix inplace error

* fix cast error

* fix error

* add nan_to_num_

* fix typro

* fix sparse cast error

* remove gpu 4

* fix static cast error

* fix cast error

* fix

* Revert "check complex in python"

This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.

* add renorm , fix error

* add coverage

* fix cumsum inpalce version error

* add cast inpalce impl

* rm test.log

* fix multiply_dyfunction and add multiply_backward test

* add and use is_same_tensor

* fix typro

* fix sone error

* fix typro

---------
Co-authored-by: NScotty <jmhgchn@gmail.com>
Co-authored-by: NScotty <527407973@qq.com>

c5fc413a

Add dimOp, tieProductEqualOp. access constraint_func in SymbolTable. Lowing... · 589588f3

由 liuruyan 提交于 8月 28, 2023

Add dimOp, tieProductEqualOp. access constraint_func in SymbolTable. Lowing DenseTensorType. (#56615)

* add symbolicDimProduct & symbolicDimMgr without method shape_constraint related.

* add pd_type.cc to ir_shape CMakeLists.

* add dimOp, tieProductEqualOp. access constraint_func in SymbolTable.

* put DenseTensorType into builtin_type.

589588f3

Y

fix bug (#56664) · 75ee1a88
由 Yuanle Liu 提交于 8月 28, 2023

75ee1a88

[NewIR] register set_value in new ir (#56436) · deee91d8

由 kangguangli 提交于 8月 28, 2023

* register set_value in new ir

* fix

* register set_value_grad

* fix

* fix

* remove debug info

* add unittest

* fix

* fix

* fix

* fix

* fix

* resolve comments

deee91d8

[AutoParallel] Simplify PADDLE_WITH_DISTRIBUTE marco using (#56361) · 62c78e26

由 Chen Weihang 提交于 8月 28, 2023

* simplify with dist marco

* polish error message format

* fix vtable error

* fix cmake error

* fix winsock redefined error

* fix windows compile error

* fix windows conpile failed

* fix merge error

* fix vec compile error

* add port.h into test_cpu_vec

* fix merge error

* try to fix winsock error

62c78e26

[Phi] move shuffle_batch to phi (#56547) · 30708028

由 Sonder 提交于 8月 28, 2023

* move shuffle_batch to phi

* remove useless codes

* add test_shuffle_batch_op to STATIC_BUILD_TESTS

* move shuffle_batch_kernel.cc to cpu folder

* move shuffle_batch_grad to phi

* rm shuffle_batch_op.h

* change year at file head

30708028

[NewIR]Split python api and vjp (#56518) · 7995a389

由 xiaoguoguo626807 提交于 8月 28, 2023

* support ir api form prim

* convert vector of int to intarray

* add reference of lbfgs

* add reference of lbfgs

* support ir api for prim

* Add more gen api

* concat python api to concat_grad

* fix gen conflict

* support vjp prim mode in new ir

* remove useless code

* add vjp autogen v1.0

* add test for prim

* resolve type conflict

* modify utils

* remove useless code

* add split op and modify some bug of vectorType

* fix conflict

* add concat python test

* add split python api to vjp

* modify build bug

* modify run bug

* fix conflict bug

* build bug fix

* modify python api bug

* modify test

* fix conflict

* fluid backward recover

* recover conflict

* reply review comments

* modify opruntimeinfo num

---------
Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>
Co-authored-by: NCharles-hit <wanghao107@baidu.com>
Co-authored-by: N0x45f <wangzhen45@baidu.com>
Co-authored-by: Nchenzhiyang <1792266893@qq.com>
Co-authored-by: NChen Zhiyang <chenzhiyang99@126.com>

7995a389

27 8月, 2023 1 次提交
- C
  【NewIR】Vjp autogen for multi-input op(concat) (#56657) · 971945ab
  由 Chen Zhiyang 提交于 8月 27, 2023
```
* gen-temp-save

* add concat vjp

* remove useless print

* code style

* remove manual concat vjp
```
  971945ab
25 8月, 2023 9 次提交
- J
  [Semi Auto] Matmul & Embedding InferBackward Rule (#56257) · 3483398c
  由 JZ-LIANG 提交于 8月 25, 2023
```
* add embedding backward rule

* update backward api

* revert api

* matmul inferbackward

* update unitest
```
  3483398c
- L
  [Reshard] Support create shard tensor and non-zero dim reshard (#56553) · 99795a13
  由 LiYuRio 提交于 8月 25, 2023
```
* support create shard dist tesnor

* support non-zero shard to replicated

* change reshard signature
```
  99795a13
- H
  New ir support fuse bn add act (#56247) · d3f4596a
  由 hong 提交于 8月 25, 2023
```
* support new ir load combine

* update

* polish code

* remove print

* update

* update

* update

* polish code

* fix bug

* polish code

* fix compile bug

* fix bug

* revert code

* remove useless code

* polish code
```
  d3f4596a
- R
  
  [CustomDevice] add comm context support (#56301) · 62397cd2
  由 ronnywang 提交于 8月 25, 2023
  
  62397cd2
- Y
  [BugFix]Fix test_build_model error (#56633) · ca5585e9
  由 YuanRisheng 提交于 8月 25, 2023
```
* fix test bugs

* delete code
```
  ca5585e9
- Y
  [Inference] auto mixed precision inference support white list (#56535) · ecff21e7
  由 Yuanle Liu 提交于 8月 25, 2023
```
* auto mixed precision inference support white list

* update

* update

* update

* move down identity_op_clean_pass

* fix code style
```
  ecff21e7
- H
  
  add cache id flags (#56616) · 5f9d6d68
  由 hong 提交于 8月 25, 2023
  
  5f9d6d68
- R
  
  [CustomDevice] Fix device id out of range in custom device resource pool (#56580) · e99b3cb2
  由 ronnywang 提交于 8月 25, 2023
  
  e99b3cb2
- L
  
  fix static_build for pp (#56643) · f2ebbce7
  由 lzydev 提交于 8月 25, 2023
  
  f2ebbce7

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功