提交 · 104a518c0e3e54795fa7a18a7c7058258ee0cb73 · PaddlePaddle / Paddle

29 6月, 2023 1 次提交
- W
  
  [IR] polish the concrete op api (#54928) · 104a518c
  由 winter-wang 提交于 6月 29, 2023
  
  104a518c
28 6月, 2023 9 次提交
- Z
  [IR] Refine PhiKernelOp attributes name and delete some unused code (#54891) · 9137adb9
  由 zhangbo9674 提交于 6月 28, 2023
```
* refine code

* add some interface for phi kernel op

* fix compile bug

* delete unused code

* support code

* fix bug

* refine code

* delete unused code

* fix compile bug

* fix compile bug

* delete unused code

* add elementwise add op

* fix compile bug
```
  9137adb9
- C
  
  support some prim ops zero dim part3 (#54919) · 272ed912
  由 Charles-hit 提交于 6月 28, 2023
  
  272ed912
- G
  【Inplace】Add copy for inplace (#54683) · 98debaa8
  由 GGBond8488 提交于 6月 28, 2023
```
* add clone for inpalce

* fix name

* add inplace pow

* fix typro

* add note

* fix typro

* fix typro

* fix bug

* fix test error

* add type error test

* adjust indentation
```
  98debaa8
- R
  
  support auto generate for static op reduce_prod (#54316) · ac94b135
  由 RedContritio 提交于 6月 28, 2023
  
  ac94b135
- Z
  add gc for multi jobs (#54897) · fcffd84d
  由 zhaoyingli 提交于 6月 28, 2023
```
* add gc for multi jobs

* fix job.h

* update OpInfo to OpInOutInfo

* update get_skip_gc_vars algo order
```
  fcffd84d
- H
  Fix output vector type bug (#54865) · 9c2dae1a
  由 hong 提交于 6月 28, 2023
```
* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* polish code

* fix vector type bug

* modify feed data type
```
  9c2dae1a
- K
  [IR] complement ir type (#54911) · ffc1b027
  由 kangguangli 提交于 6月 28, 2023
```
* complement ir type

* fix ir_printer
```
  ffc1b027
- B
  [inference][trt]add Einsum op (#54860) · 69bf5ee8
  由 bukejiyu 提交于 6月 28, 2023
```
* add einsum layer
```
  69bf5ee8
- W
  
  [IR] add op_operand api for ir::Operation. (#54875) · c3077ec1
  由 winter-wang 提交于 6月 28, 2023
  
  c3077ec1
27 6月, 2023 10 次提交

W

[IR] rectify the verify api (#54895) · 96652265
由 winter-wang 提交于 6月 27, 2023

96652265
W
[Paddle Inference]Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass,… (#54861) · e49c17d2
由 Wangzheee 提交于 6月 27, 2023
```
* Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass, embedding_eltwise_layernorm_fuse_pass
```
e49c17d2
R

fix compiler error (#54883) · 8dc97857
由 risemeup1 提交于 6月 27, 2023

8dc97857

[Semi-Auto] SPMD Parallel Rule Base (#53863) · 6863e2ae

由 JZ-LIANG 提交于 6月 27, 2023

* base rule

* add sharidng merge

* add sharidng axis merge

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* matmul main logic done

* define unified data class for inferencing dist_attr

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>

6863e2ae

X

add xpu_optimize_cachekv_initialization_pass (#54809) · 610a47dd
由 xinxinZi 提交于 6月 27, 2023

610a47dd
Y

fix xpu retry allocator bug (#54847) · 5bbbf5dd
由 ykkk2333 提交于 6月 27, 2023

5bbbf5dd
周

commit (#54894) · 70288456
由周周周提交于 6月 27, 2023

70288456

Code merge | Merge CINN into Paddle (#54749) · 67c69dca

由 6clc 提交于 6月 27, 2023

* feat(cmake): add cmake of cinn

* feat(cmake): add cmake of cinn python test

* feat(cmake): add jit

* feat(cmake): test/CMakeList.txt

* feat(cmake): rebase to develop

* feat(cmake): remove some flags

* fix(cmake): fix cinn's gflags depends

* feat(cmake): add ci scripts of cinn

* feat(cmake): copy code of cinn

* fix(cmake): fix cinn third_party model path

* gflags dynamic dependce

* fix ci build_demo

* tmp update to c++17 of cinn-only test

* fix cinn only with c++17

67c69dca

Z

delete_assign_op_pass (#54887) · 813266a2
由 zhupengyang 提交于 6月 27, 2023

813266a2

New ir support data transfer (#54763) · b58869fa

由 hong 提交于 6月 27, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* add env flag

* fix bug

* revert test env

* change cc_test_old to cc_test

* fix build_static bug

* fix type test error

* udpate cmake

* disable test in windows

* update

* update

* fix bug

* split file

* fix conflict

* polish code and fix conflict

* support place transformer

* finish bug

* add gpu flags

* fix with cuda macro

* update

* add scope guard

* polish code

b58869fa

26 6月, 2023 9 次提交

[inference][trt] optimize set_value and top_k op (#54372) · e25e86f4

由 Zhang Jun 提交于 6月 26, 2023

* set_value update

* support ValueTensor's rank != Input'rank & update topk

* update range to avoid coredump

* fix addShape error

* Dims definition differ between 7.2 and 8.0+

* Update test_trt_convert_top_k_v2.py

* update top_k

* Update test_trt_convert_top_k_v2.py

e25e86f4

Support feed op new ir (#54840) · 1e323137

由 hong 提交于 6月 26, 2023

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* polish code

1e323137

X
[XPU] support xpu runtime profiler: follow up (#54690) · 9c3f4b13
由 XiaociZhang 提交于 6月 26, 2023
```
* [XPU] support xpu runtime profiler: follow up

* fix compile issue
```
9c3f4b13
W

add squeeze2+matmul pass (#54779) · f1c8d3fa
由 wz1qqx 提交于 6月 26, 2023

f1c8d3fa
C

support auto generation for gather (#54084) · ffeac6d5
由 cyberslack_lee 提交于 6月 26, 2023

ffeac6d5
S

Support static graph code-gen for bincount (#54686) · b547c4ac
由 Sanbu 提交于 6月 26, 2023

b547c4ac

remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85

由 Sonder 提交于 6月 26, 2023

* remove ops from OpsWithFluidKernelNeedMoveToPhi set

* open static build flag

* OpsWithFluidKernelNeedMoveToPhi

* open new_executor_static_build

* add infermate for cudnn_lstm

* fix

* update

* fix

* update

* update

* update

* fix pow2 decay

* fix pow2 decay

* recover analysis_predictor.cc

* fix pow2 decay

* fix cudnn lstm

* add output register info for svd

* fix pow2_decay_with_linear_warmup_kernel

* recover test lstm cudnn

* recover svg register codes

* fix register info

* fix reduce sum register info

* add output info for adadelta

* add output info for adadelta

* add output info for adamax

* fix complex abs register info

* add register info for cudnn_lstm_grad

* recover

* fix lstm cudnn

* fix

* fix xpu output registe info

* remove std::cout

* add backend

* remove output info in pow2_decay_with_linear_warmup_kernel

* add judgment in TensorShouldBeFakeInitialized

* recover power_

* close new_executor_static_build

* fix set_value_xpu

733eca85

R
Share workqueue cross-interpretercores (#54780) · 59dd97af
由 Ruibiao Chen 提交于 6月 26, 2023
```
* Share workqueue cross-interpretercores

* Fix UT
```
59dd97af
Z

delete_repeated_ops_pass and reshape_unstack_concat_fuse_pass (#54846) · 05bd4a89
由 zhupengyang 提交于 6月 26, 2023

05bd4a89

25 6月, 2023 3 次提交

Support fetch in new ir (#54826) · e66beb0b

由 hong 提交于 6月 25, 2023

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

e66beb0b

auto parallel support pipeline scheduler with standalone executor (#54727) · a702e170

由 zhaoyingli 提交于 6月 25, 2023

* auto parallel support pipeline scheduler with standalone executor

* rm check_fetch

* update cmakelist and flags env

* rm set micro batch id

* rm import

* update utils func

* raise error when merge tensor for return_numpy is False

* fix _pipeline_opt

* fix unittest

a702e170

[Prim] Fix batch_norm bias_grad loss in cinn (#54751) · 3e0f0a00

由 cyber-pioneer 提交于 6月 25, 2023

* fix batch_norm grad kernel nhwc error

* fix batch_norm bias_grad loss in cinn

* disable cinn

* fix cinn_atol

3e0f0a00

22 6月, 2023 1 次提交
- H
  [IR] Refactor op yaml info parser (#54790) · 1b8a1a98
  由 hong 提交于 6月 22, 2023
```
* update

* update

* polish code

* polish code
```
  1b8a1a98
21 6月, 2023 4 次提交
- C
  
  [XPU][Inference] Delete redundant squeeze/unsqueeze op. (#54754) · 7f6bb160
  由 csy0225 提交于 6月 21, 2023
  
  7f6bb160
- B
  [inference][trt]test_emb_eltwise_layernorm_fuse_pass cuda12 fix (#54757) · 55704db5
  由 bukejiyu 提交于 6月 21, 2023
```
* modify tensorrt ci timeout

* activation ci bug fix

* Update CMakeLists.txt

* Update CMakeLists.txt

* comment out  int8 mode test_trt_dynamic_shape_groupnorm

* fix test_sync_batch_norm_op Segmentation fault
```
  55704db5
- 周
  [Paddle Inference]add info in memory_optimize_pass.cc (#54789) · 3371f98b
  由周周周提交于 6月 21, 2023
```
* add info in memory_optimize_pass.cc
```
  3371f98b
- X
  
  add delete_xpu_unnecessary_cast_op_pass (#54663) · 98a165bf
  由 xinxinZi 提交于 6月 21, 2023
  
  98a165bf
20 6月, 2023 3 次提交
- W
  static graph autogen code support for matmul op (#54338) · ad80fbfe
  由 Wang Xin 提交于 6月 20, 2023
```
* static graph autogen code support for matmul op

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug
```
  ad80fbfe
- H
  
  [Fix Bug] Fix random fail of CustomOp due to python lock (#54772) · 87054fe3
  由 HongyuJia 提交于 6月 20, 2023
  
  87054fe3
- prepare for collective communicate upgrade in dygraph (#54417) · 46c57674
  由 TaoTao Li 提交于 6月 20, 2023
  
  46c57674

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功