提交 · b7fbd33991893bad7846615fa53159e6241052bd · PaddlePaddle / Paddle

28 6月, 2023 4 次提交
- Y
  
  Support 0-D Tensor for check_numerics_kernel. (#54868) · b7fbd339
  由 Yiqun Liu 提交于 6月 28, 2023
  
  b7fbd339
- B
  [inference][trt]add Einsum op (#54860) · 69bf5ee8
  由 bukejiyu 提交于 6月 28, 2023
```
* add einsum layer
```
  69bf5ee8
- Q
  
  update xpu api date (#54900) · 6da0a24d
  由 QingshuChen 提交于 6月 28, 2023
  
  6da0a24d
- W
  
  [IR] add op_operand api for ir::Operation. (#54875) · c3077ec1
  由 winter-wang 提交于 6月 28, 2023
  
  c3077ec1
27 6月, 2023 19 次提交

W

[IR] rectify the verify api (#54895) · 96652265
由 winter-wang 提交于 6月 27, 2023

96652265
W
[Paddle Inference]Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass,… (#54861) · e49c17d2
由 Wangzheee 提交于 6月 27, 2023
```
* Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass, embedding_eltwise_layernorm_fuse_pass
```
e49c17d2
C

support zero_dim for some prim ops (#54892) · f8d02146
由 Charles-hit 提交于 6月 27, 2023

f8d02146

[BugFix] fix bugs in DCU unit tests (#54874) · abc1c3d4

由 lishicheng1996 提交于 6月 27, 2023

* block bf16 tests on ROCM

* block more bf16 tests on ROCM

* some unittest cases doesn't have kernels on ROCm

* some unittest cases doesn't have kernels on ROCm

* fix code style

abc1c3d4

Z
delete swish_raw (#54536) · 0cdaafea
由 zhangyuqin1998 提交于 6月 27, 2023
```
* delete swish_raw

* fix

* Update activation_kernel.cc

* fix
```
0cdaafea
X
【prim】modify eular_beam (#54736) · 7c2c965d
由 xiaoguoguo626807 提交于 6月 27, 2023
```
* modify eular_beam

* modify matmul infermeta

* add test

* modify timeout
```
7c2c965d
R

fix compiler error (#54883) · 8dc97857
由 risemeup1 提交于 6月 27, 2023

8dc97857

[Semi-Auto] SPMD Parallel Rule Base (#53863) · 6863e2ae

由 JZ-LIANG 提交于 6月 27, 2023

* base rule

* add sharidng merge

* add sharidng axis merge

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* matmul main logic done

* define unified data class for inferencing dist_attr

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>

6863e2ae

L

fix bug when place 'use_cudnn' in extra (#54766) · 689e27af
由 lzydev 提交于 6月 27, 2023

689e27af
X

add xpu_optimize_cachekv_initialization_pass (#54809) · 610a47dd
由 xinxinZi 提交于 6月 27, 2023

610a47dd
L
replace the CosineDecay in fluid with 2.0 version (#54829) · 5a804830
由 LoneRanger 提交于 6月 27, 2023
```
* remove the CosineDecay in fluid

* Update test_basic_api_transformation.py
```
5a804830

replace NaturalExpDecay, ExponentialDecay, InverseTimeDecay with 2.0 version (#54424) · de60c1d1

由 LoneRanger 提交于 6月 27, 2023

* remove the NaturalExpDecay in fluid

* fix bug

* remove the ExponentialDecay in fluid

* remove the InverseTimeDecay in fluid

* remove the InverseTimeDecay class

* fix bug

de60c1d1

Y

fix xpu retry allocator bug (#54847) · 5bbbf5dd
由 ykkk2333 提交于 6月 27, 2023

5bbbf5dd
add all_to_all phi operator (#54797) · 158b7ae5
由 TaoTao Li 提交于 6月 27, 2023
```
* add all_to_all phi operator, kernel, api

* add all_to_all ut

* tinyfix
```
158b7ae5
周

commit (#54894) · 70288456
由周周周提交于 6月 27, 2023

70288456

Code merge | Merge CINN into Paddle (#54749) · 67c69dca

由 6clc 提交于 6月 27, 2023

* feat(cmake): add cmake of cinn

* feat(cmake): add cmake of cinn python test

* feat(cmake): add jit

* feat(cmake): test/CMakeList.txt

* feat(cmake): rebase to develop

* feat(cmake): remove some flags

* fix(cmake): fix cinn's gflags depends

* feat(cmake): add ci scripts of cinn

* feat(cmake): copy code of cinn

* fix(cmake): fix cinn third_party model path

* gflags dynamic dependce

* fix ci build_demo

* tmp update to c++17 of cinn-only test

* fix cinn only with c++17

67c69dca

W

[IR&PASS] part 3-3: Add PatternRewrite Driver code. (#54738) · 72b8c7c2
由 Wilber 提交于 6月 27, 2023

72b8c7c2
Z

delete_assign_op_pass (#54887) · 813266a2
由 zhupengyang 提交于 6月 27, 2023

813266a2

New ir support data transfer (#54763) · b58869fa

由 hong 提交于 6月 27, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* add env flag

* fix bug

* revert test env

* change cc_test_old to cc_test

* fix build_static bug

* fix type test error

* udpate cmake

* disable test in windows

* update

* update

* fix bug

* split file

* fix conflict

* polish code and fix conflict

* support place transformer

* finish bug

* add gpu flags

* fix with cuda macro

* update

* add scope guard

* polish code

b58869fa

26 6月, 2023 17 次提交

Y

add ut for pass_infra (AnalysisManager, PreservedAnalyses) (#54849) · 9307d357
由 Yuanle Liu 提交于 6月 26, 2023

9307d357
C

test case of batch_norm support cinn when training mode and nchw (#54862) · fa44ea5c
由 cyber-pioneer 提交于 6月 26, 2023

fa44ea5c
P

exclude xpu (#54848) · 6962d3e2
由 pangengzheng 提交于 6月 26, 2023

6962d3e2

[inference][trt] optimize set_value and top_k op (#54372) · e25e86f4

由 Zhang Jun 提交于 6月 26, 2023

* set_value update

* support ValueTensor's rank != Input'rank & update topk

* update range to avoid coredump

* fix addShape error

* Dims definition differ between 7.2 and 8.0+

* Update test_trt_convert_top_k_v2.py

* update top_k

* Update test_trt_convert_top_k_v2.py

e25e86f4

[cmake] add dgc jemalloc third_party cache (#54392) · 34cfbe79

由 Sanbu 提交于 6月 26, 2023

* add dgc jemalloc third_party cache

* fix

* fix ci

* fix ci

* fix

* fix

* fix

* Update jemalloc.cmake

* fix

* fix

* Update jemalloc.cmake

* Update jemalloc.cmake

* Update jemalloc.cmake

* Update jemalloc.cmake

* fix

* Update dgc.cmake

* fix style

* fix cmake equal to strequal

* fix

* fix

---------
Co-authored-by: Nrisemeup1 <62429225+risemeup1@users.noreply.github.com>

34cfbe79

Support feed op new ir (#54840) · 1e323137

由 hong 提交于 6月 26, 2023

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* polish code

1e323137

M

DOC: typo corrected in docs of logical_xor (#54858) · 5d9af9db
由 Muhammad Ishaque Nizamani 提交于 6月 26, 2023

5d9af9db
X
[XPU] support xpu runtime profiler: follow up (#54690) · 9c3f4b13
由 XiaociZhang 提交于 6月 26, 2023
```
* [XPU] support xpu runtime profiler: follow up

* fix compile issue
```
9c3f4b13
C

Fix eigen multiple definition (#54812) · ba09621a
由 chalsliu 提交于 6月 26, 2023

ba09621a
C

[Inference] Fix IR shared lib not found · 3ec55faa
由 chalsliu 提交于 6月 26, 2023

3ec55faa
G

[cmake] fix Cache (#54764) · 0aca659b
由 gouzil 提交于 6月 26, 2023

0aca659b
W

add squeeze2+matmul pass (#54779) · f1c8d3fa
由 wz1qqx 提交于 6月 26, 2023

f1c8d3fa
C

support auto generation for gather (#54084) · ffeac6d5
由 cyberslack_lee 提交于 6月 26, 2023

ffeac6d5
S

Support static graph code-gen for bincount (#54686) · b547c4ac
由 Sanbu 提交于 6月 26, 2023

b547c4ac

remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85

由 Sonder 提交于 6月 26, 2023

* remove ops from OpsWithFluidKernelNeedMoveToPhi set

* open static build flag

* OpsWithFluidKernelNeedMoveToPhi

* open new_executor_static_build

* add infermate for cudnn_lstm

* fix

* update

* fix

* update

* update

* update

* fix pow2 decay

* fix pow2 decay

* recover analysis_predictor.cc

* fix pow2 decay

* fix cudnn lstm

* add output register info for svd

* fix pow2_decay_with_linear_warmup_kernel

* recover test lstm cudnn

* recover svg register codes

* fix register info

* fix reduce sum register info

* add output info for adadelta

* add output info for adadelta

* add output info for adamax

* fix complex abs register info

* add register info for cudnn_lstm_grad

* recover

* fix lstm cudnn

* fix

* fix xpu output registe info

* remove std::cout

* add backend

* remove output info in pow2_decay_with_linear_warmup_kernel

* add judgment in TensorShouldBeFakeInitialized

* recover power_

* close new_executor_static_build

* fix set_value_xpu

733eca85

R
Share workqueue cross-interpretercores (#54780) · 59dd97af
由 Ruibiao Chen 提交于 6月 26, 2023
```
* Share workqueue cross-interpretercores

* Fix UT
```
59dd97af
Z

delete_repeated_ops_pass and reshape_unstack_concat_fuse_pass (#54846) · 05bd4a89
由 zhupengyang 提交于 6月 26, 2023

05bd4a89

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功