提交 · cfa513f7ffdd3468c6aa6358fe3aa6be077a385c · PaddlePaddle / Paddle

12 7月, 2023 2 次提交

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

07 7月, 2023 5 次提交
- W
  
  [XPU] Add layernorm fuse pass (#55154) · eb12739e
  由 wz1qqx 提交于 7月 07, 2023
  
  eb12739e
- W
  
  [XPU] Eliminate small ops (#55193) · b8f265d2
  由 wz1qqx 提交于 7月 07, 2023
  
  b8f265d2
- Y
  rename WITH_INFERENCE_NVTX to WITH_NVTX and fix compile bug (#55219) · 43843192
  由 Yuanle Liu 提交于 7月 07, 2023
```
* fix WITH_SHARED_IR option type

* rename WITH_INFERENCE_NVTX to WITH_NVTX and fix compile bug

* update
```
  43843192
- H
  
  fix exception bug (#55216) · 31edad21
  由 hong 提交于 7月 07, 2023
  
  31edad21
- E
  
  [CustomDevice]fix == error with place (#55173) · 70df3aa4
  由 engineer1109 提交于 7月 07, 2023
  
  70df3aa4
06 7月, 2023 2 次提交
- 傅
  [CINN] Re-Implement operator = for two Expr Tree (#55145) · af58cc37
  由傅剑寒提交于 7月 06, 2023
```
* optimize expr operator = implementation

* fix codestyle
```
  af58cc37
- Z
  [IR] Refine some code for NewIRInterpreter (#55169) · e9f9da14
  由 zhangbo9674 提交于 7月 06, 2023
```
* fix bug

* fix bug

* refien code

* refien code

* fix bug

* refine code
```
  e9f9da14
05 7月, 2023 3 次提交

[IR] New IR access InterpreterCore：add local scope logic (#55112) · 85831c32

由 zhangbo9674 提交于 7月 05, 2023

* add local scope

* refine code

* refien code

* refine code

* support local scope for BuildFuncList

* fix bug

* add log

* fix bug

* polish code

* fix bug

85831c32

W

[XPU] add reduce_max_fuse_pass (#54981) · 54a101d5
由 wz1qqx 提交于 7月 05, 2023

54a101d5

[NewIR]Fix tensor attribute translator bug (#55129) · bf92ccc7

由 hong 提交于 7月 05, 2023

* suport optional input in new_ir

* polish code

* add coverate test

* update

* update

* add unitest

* remove reduplicate code

* udpate

* fix assign error

* revert test arg min max

* update

* fix bug

* polish code

bf92ccc7

04 7月, 2023 2 次提交
- H
  
  posh code (#55114) · b4a149a5
  由 hong 提交于 7月 04, 2023
  
  b4a149a5
- H
  [NewIR]Fix null value and support some attribute (#55100) · a2903920
  由 hong 提交于 7月 04, 2023
```
* suport optional input in new_ir

* polish code

* add coverate test

* update

* update

* add unitest

* remove reduplicate code

* set test timeout
```
  a2903920
03 7月, 2023 2 次提交
- J
  [XPU] Fix the topk, set_value ops that using temporary tensors avoiding the... · cc2059a0
  由 jiangfan06 提交于 7月 03, 2023
```
[XPU] Fix the topk, set_value ops that using temporary tensors avoiding the memory overlaps during multi-stream inference (#54851)
```
  cc2059a0
- 周
  [Paddle-TRT] use hook to collect shape in CollectShapeRangeInfo API. (#54841) · 989f3dde
  由周周周提交于 7月 03, 2023
```
* commit

* commit

* commit

* commit

* final commit

* use hook to collect shape and shape value
```
  989f3dde
30 6月, 2023 3 次提交
- M
  
  [XPU] Add conv2d transpose fuse pass (#54904) · 12c15b89
  由 mjp9527 提交于 6月 30, 2023
  
  12c15b89
- Y
  
  [IR&PASS] add conv + bn fuse pattern, and other works (#54933) · 19345fa7
  由 Yuanle Liu 提交于 6月 30, 2023
  
  19345fa7
- J
  
  fix cachek_kv_layout pass (#54994) · 19d6a988
  由 JiangHao 提交于 6月 30, 2023
  
  19d6a988
29 6月, 2023 4 次提交

H
Refactor build attribute (#54968) · eef38db1
由 hong 提交于 6月 29, 2023
```
* update

* refactor build context

* fix bug

* polish code

* change func name
```
eef38db1

张

[CodeStyle][CINN] format cpp code via clang-format (#54961) · af127342

由张经纬提交于 6月 29, 2023

* fix clang-format

* 'fix_clang-format'

* fix remaining errors

* format

* empty commit, re-trigger all ci

* empty commit, re-trigger all ci

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

af127342

Refactor op info parser (#54859) · f18d538b

由 hong 提交于 6月 29, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* add env flag

* fix bug

* revert test env

* change cc_test_old to cc_test

* fix build_static bug

* fix type test error

* udpate cmake

* disable test in windows

* update

* update

* fix bug

* split file

* fix conflict

* polish code and fix conflict

* support place transformer

* finish bug

* add gpu flags

* fix with cuda macro

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* update

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* add scope guard

* update

* polish code

* update

* refactor build kernel context

* fix unitest bug

* polish code

* use original order

* remove useless code

* polish code

* fix bug

f18d538b

W

[XPU]add layer_norm fuse pass (#54930) · b94b3ac0
由 wz1qqx 提交于 6月 28, 2023

b94b3ac0

28 6月, 2023 2 次提交

[IR] Refine PhiKernelOp attributes name and delete some unused code (#54891) · 9137adb9

由 zhangbo9674 提交于 6月 28, 2023

* refine code

* add some interface for phi kernel op

* fix compile bug

* delete unused code

* support code

* fix bug

* refine code

* delete unused code

* fix compile bug

* fix compile bug

* delete unused code

* add elementwise add op

* fix compile bug

9137adb9

add gc for multi jobs (#54897) · fcffd84d

由 zhaoyingli 提交于 6月 28, 2023

* add gc for multi jobs

* fix job.h

* update OpInfo to OpInOutInfo

* update get_skip_gc_vars algo order

fcffd84d

27 6月, 2023 7 次提交

W
[Paddle Inference]Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass,… (#54861) · e49c17d2
由 Wangzheee 提交于 6月 27, 2023
```
* Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass, embedding_eltwise_layernorm_fuse_pass
```
e49c17d2
R

fix compiler error (#54883) · 8dc97857
由 risemeup1 提交于 6月 27, 2023

8dc97857
X

add xpu_optimize_cachekv_initialization_pass (#54809) · 610a47dd
由 xinxinZi 提交于 6月 27, 2023

610a47dd
周

commit (#54894) · 70288456
由周周周提交于 6月 27, 2023

70288456

Code merge | Merge CINN into Paddle (#54749) · 67c69dca

由 6clc 提交于 6月 27, 2023

* feat(cmake): add cmake of cinn

* feat(cmake): add cmake of cinn python test

* feat(cmake): add jit

* feat(cmake): test/CMakeList.txt

* feat(cmake): rebase to develop

* feat(cmake): remove some flags

* fix(cmake): fix cinn's gflags depends

* feat(cmake): add ci scripts of cinn

* feat(cmake): copy code of cinn

* fix(cmake): fix cinn third_party model path

* gflags dynamic dependce

* fix ci build_demo

* tmp update to c++17 of cinn-only test

* fix cinn only with c++17

67c69dca

Z

delete_assign_op_pass (#54887) · 813266a2
由 zhupengyang 提交于 6月 27, 2023

813266a2

New ir support data transfer (#54763) · b58869fa

由 hong 提交于 6月 27, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* add env flag

* fix bug

* revert test env

* change cc_test_old to cc_test

* fix build_static bug

* fix type test error

* udpate cmake

* disable test in windows

* update

* update

* fix bug

* split file

* fix conflict

* polish code and fix conflict

* support place transformer

* finish bug

* add gpu flags

* fix with cuda macro

* update

* add scope guard

* polish code

b58869fa

26 6月, 2023 6 次提交

Support feed op new ir (#54840) · 1e323137

由 hong 提交于 6月 26, 2023

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* polish code

1e323137

X
[XPU] support xpu runtime profiler: follow up (#54690) · 9c3f4b13
由 XiaociZhang 提交于 6月 26, 2023
```
* [XPU] support xpu runtime profiler: follow up

* fix compile issue
```
9c3f4b13
W

add squeeze2+matmul pass (#54779) · f1c8d3fa
由 wz1qqx 提交于 6月 26, 2023

f1c8d3fa

remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85

由 Sonder 提交于 6月 26, 2023

* remove ops from OpsWithFluidKernelNeedMoveToPhi set

* open static build flag

* OpsWithFluidKernelNeedMoveToPhi

* open new_executor_static_build

* add infermate for cudnn_lstm

* fix

* update

* fix

* update

* update

* update

* fix pow2 decay

* fix pow2 decay

* recover analysis_predictor.cc

* fix pow2 decay

* fix cudnn lstm

* add output register info for svd

* fix pow2_decay_with_linear_warmup_kernel

* recover test lstm cudnn

* recover svg register codes

* fix register info

* fix reduce sum register info

* add output info for adadelta

* add output info for adadelta

* add output info for adamax

* fix complex abs register info

* add register info for cudnn_lstm_grad

* recover

* fix lstm cudnn

* fix

* fix xpu output registe info

* remove std::cout

* add backend

* remove output info in pow2_decay_with_linear_warmup_kernel

* add judgment in TensorShouldBeFakeInitialized

* recover power_

* close new_executor_static_build

* fix set_value_xpu

733eca85

R
Share workqueue cross-interpretercores (#54780) · 59dd97af
由 Ruibiao Chen 提交于 6月 26, 2023
```
* Share workqueue cross-interpretercores

* Fix UT
```
59dd97af
Z

delete_repeated_ops_pass and reshape_unstack_concat_fuse_pass (#54846) · 05bd4a89
由 zhupengyang 提交于 6月 26, 2023

05bd4a89

25 6月, 2023 1 次提交

Support fetch in new ir (#54826) · e66beb0b

由 hong 提交于 6月 25, 2023

* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

e66beb0b

21 6月, 2023 1 次提交
- C
  
  [XPU][Inference] Delete redundant squeeze/unsqueeze op. (#54754) · 7f6bb160
  由 csy0225 提交于 6月 21, 2023
  
  7f6bb160

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功