提交 · b8d7f80142eaa96764a14d122ac7c2693d85f40f · PaddlePaddle / Paddle

23 8月, 2023 1 次提交
- T
  
  Add fuse pass to remove duplicated transpose ops (#56326) · b8d7f801
  由 Travis-Lee 提交于 8月 23, 2023
  
  b8d7f801
18 8月, 2023 1 次提交

[Inference] Make share_external_data supports bf16 and bool; fix while_op... · c65ef07c

由 lzy 提交于 8月 18, 2023

[Inference] Make share_external_data supports bf16 and bool; fix while_op cache_inference_while_scope when using fleet_executor. (#56055)

* 1. make share_external_data supports bf16 and bool; 2. don't drop_kids when cache_inference_while_scope

* fix FLAGS_cache_inference_while_scope

* add unitest

* add unitest

* skip unitest when cudnn_version < 8100

* skip test share_external_data_bf16 when CUDA_ARCH < 80

c65ef07c

17 8月, 2023 1 次提交

Add MarkTrtEngineOutputs API (#56188) · 2abf4326

由 ming1753 提交于 8月 17, 2023

* [paddle-TRT] support mark output

* [fix bug] hook function only call one in different predictor

* add api test

2abf4326

16 8月, 2023 1 次提交
- J
  
  [XPU] Add fast_layernorm_xpu_fuse_pass and fast_layernorm_xpu plugin (#56269) · f16e1869
  由 jiangfan06 提交于 8月 16, 2023
  
  f16e1869
14 8月, 2023 1 次提交
- C
  
  [clang-tidy] No.31 enable modernize-use-bool-literals (#56216) · 2c307457
  由 cyberslack_lee 提交于 8月 14, 2023
  
  2c307457
10 8月, 2023 1 次提交
- C
  
  [XPU] Add transfilter when conv2d op dilation > 1 (#55978) · 81c56e27
  由 csy0225 提交于 8月 10, 2023
  
  81c56e27
09 8月, 2023 2 次提交
- X
  [oneDNN]rename macro to PADDLE_WITH_DNNL (#52208) · 6ff4c130
  由 Xinyu Chen 提交于 8月 09, 2023
```
* onednn: rename macro to PADDLE_WITH_DNNL

* onednn: rename macro to CINN_WITH_DNNL
```
  6ff4c130
- R
  
  [clang-tidy] fix modernize-make-unique (#55764) · 9f04f2ac
  由 Ruibin Cheung 提交于 8月 09, 2023
  
  9f04f2ac
07 8月, 2023 3 次提交
- Y
  [Inference] save_optimized_model_pass support tensorrt (#55893) · 6b10c0e5
  由 Yuanle Liu 提交于 8月 07, 2023
```
* fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward

* save_optimized_model_pass support tensorrt

* update

* update

* fix compile

* update

* fix ut timeout
```
  6b10c0e5
- G
  
  [clang-tidy] NO.6 enable `modernize-avoid-c-arrays` step: 2 (#55954) · 5ada98b8
  由 gouzil 提交于 8月 07, 2023
  
  5ada98b8
- R
  
  [clang-tidy] enable modernize-use-equals-default (#55983) · 30a02d27
  由 Ruibin Cheung 提交于 8月 07, 2023
  
  30a02d27
04 8月, 2023 1 次提交

[clang-tidy] enable modernize-use-emplace (#55799) · 469a0392

由 Ruibin Cheung 提交于 8月 04, 2023

* [clang-tidy] enable modernize-use-emplace

* Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modernize_use_emplace

469a0392

03 8月, 2023 2 次提交
- W
  
  [clang-tidy] [No.4] enable `modernize-loop-convert` (#55704) · 81ccd99e
  由 Wang Xin 提交于 8月 03, 2023
  
  81ccd99e
- W
  
  eliminate small pattern (#55843) · dc4b48f6
  由 wz1qqx 提交于 8月 03, 2023
  
  dc4b48f6
02 8月, 2023 2 次提交
- W
  
  [XPU]Add conv1d fuse pass (#55719) · 22c7a6eb
  由 wz1qqx 提交于 8月 02, 2023
  
  22c7a6eb
- J
  
  [XPU] Add gather_squeeze_pass (#55605) · d13a49d6
  由 jiangfan06 提交于 8月 02, 2023
  
  d13a49d6
01 8月, 2023 1 次提交
- H
  
  [XPU] Add fast_where fusion op and XPU micro kernel (#55628) · 07e788f1
  由 hong19860320 提交于 8月 01, 2023
  
  07e788f1
27 7月, 2023 1 次提交
- M
  [Paddle-TRT] add flip op (#55688) · d608170a
  由 ming1753 提交于 7月 27, 2023
```
* [Paddle-TRT] add flip op
```
  d608170a
24 7月, 2023 2 次提交

[Paddle-TRT] Convert 0D tensor to 1D tensor, increase the shape tensor's... · a3cf25e3

由 chen 提交于 7月 24, 2023

[Paddle-TRT] Convert 0D tensor to 1D tensor, increase the shape tensor's number count when collecting shape (#55503)

* make 0-D tensor to 1-D tensor to support Grounding-SAM and add shape check

* recover identity_op_clean_pass.cc

a3cf25e3

onednn: remove fc_elementwise_add fusion (#55504) · bea1f04c

由 Xinyu Chen 提交于 7月 24, 2023

* onednn: remove fc+eltwiseadd fusion pass
* onednn: remove post-sum fusion in fc kernel
* onednn: tests: make unfused add run into f32

bea1f04c

21 7月, 2023 2 次提交
- Y
  [Inference] save_optimized_model_pass support gpu (#55551) · 4b3ac86d
  由 Yuanle Liu 提交于 7月 21, 2023
```
* fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward

* save_optimized_model_pass support gpu
```
  4b3ac86d
- R
  
  [clang-tidy] enable modernize-make-unique (#55506) · 45d49619
  由 Ruibin Cheung 提交于 7月 21, 2023
  
  45d49619
20 7月, 2023 2 次提交
- L
  Fix UT failure (#55360) · 7eeff7b1
  由 Leo Chen 提交于 7月 20, 2023
```
* Fix TRT multihead matmul UT failure
```
  7eeff7b1
- Z
  
  [XPU] fuse cast to conv2d/fc in mixed precision model (#54493) · 4df00939
  由 zhupengyang 提交于 7月 20, 2023
  
  4df00939
19 7月, 2023 2 次提交
- C
  
  add TRT op unbind (#55476) · 4a55f5e7
  由 chen 提交于 7月 19, 2023
  
  4a55f5e7
- C
  
  Delete repeat ops add gather squeeze unsqueeze (#55371) · 552ed8d8
  由 csy0225 提交于 7月 19, 2023
  
  552ed8d8
17 7月, 2023 1 次提交
- M
  [Paddle-TRT] add assign op (#55426) · d778737e
  由 ming1753 提交于 7月 17, 2023
```
* [Paddle-TRT] add assign op
```
  d778737e
13 7月, 2023 1 次提交
- Y
  [BugFix] Replace include dense_tensor.h with forward declare in phi lib (#55396) · 9619443b
  由 Yuanle Liu 提交于 7月 13, 2023
```
* copy dense_tensor.h to inference lib

* update

* update
```
  9619443b
12 7月, 2023 3 次提交

Y
[Inference] rewrite identity_op_clean_pass (#55240) · 2363e623
由 Yuanle Liu 提交于 7月 12, 2023
```
* rewrite identity_op_clean_pass

* fix

* adjust identity_op_clean_pass order in gpu passes

* fix ut
```
2363e623

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

07 7月, 2023 2 次提交
- W
  
  [XPU] Add layernorm fuse pass (#55154) · eb12739e
  由 wz1qqx 提交于 7月 07, 2023
  
  eb12739e
- W
  
  [XPU] Eliminate small ops (#55193) · b8f265d2
  由 wz1qqx 提交于 7月 07, 2023
  
  b8f265d2
05 7月, 2023 1 次提交
- W
  
  [XPU] add reduce_max_fuse_pass (#54981) · 54a101d5
  由 wz1qqx 提交于 7月 05, 2023
  
  54a101d5
03 7月, 2023 1 次提交
- 周
  [Paddle-TRT] use hook to collect shape in CollectShapeRangeInfo API. (#54841) · 989f3dde
  由周周周提交于 7月 03, 2023
```
* commit

* commit

* commit

* commit

* final commit

* use hook to collect shape and shape value
```
  989f3dde
30 6月, 2023 1 次提交
- M
  
  [XPU] Add conv2d transpose fuse pass (#54904) · 12c15b89
  由 mjp9527 提交于 6月 30, 2023
  
  12c15b89
29 6月, 2023 2 次提交
- W
  
  [XPU]add layer_norm fuse pass (#54930) · b94b3ac0
  由 wz1qqx 提交于 6月 28, 2023
  
  b94b3ac0
- W
  
  add lookup_table op for Paddle-TRT (#54882) · 7c89b972
  由 Wangzheee 提交于 6月 29, 2023
  
  7c89b972
28 6月, 2023 1 次提交
- B
  [inference][trt]add Einsum op (#54860) · 69bf5ee8
  由 bukejiyu 提交于 6月 28, 2023
```
* add einsum layer
```
  69bf5ee8
27 6月, 2023 1 次提交
- Z
  
  delete_assign_op_pass (#54887) · 813266a2
  由 zhupengyang 提交于 6月 27, 2023
  
  813266a2

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功