提交 · bbf2bc2b8664d00a669a70a800b534d1976c38d7 · PaddlePaddle / Paddle

28 2月, 2023 1 次提交
- Z
  forbid tensorrt_engine op's output is a persistable var (#50932) · bbf2bc2b
  由 zhoutianzi666 提交于 2月 28, 2023
```
* forbid tensorrt_engine op's output is a persistable var
```
  bbf2bc2b
27 2月, 2023 1 次提交
- W
  [TRT] Add sm version check for TensorRT flash attention and cross attention pass/plugin (#50830) · 38dad3b9
  由 Wang Bojun 提交于 2月 27, 2023
```
* add sm version check

* use GetGPUComputeCapability
```
  38dad3b9
24 2月, 2023 1 次提交

由 Sławomir Siwek 提交于 2月 24, 2023

* ConvertToFusedOp

* change static to inline
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

9429936c

23 2月, 2023 2 次提交
- C
  
  [XPU] Migrate xpu_embedding_with_eltwise_add_fuse_pass (#50590) · 8d325d82
  由 csy0225 提交于 2月 23, 2023
  
  8d325d82
- Z
  
  [XPU] optimize multi_encoder_xpu_pass (#50759) · 5c9299e5
  由 zhupengyang 提交于 2月 23, 2023
  
  5c9299e5
22 2月, 2023 1 次提交
- Z
  
  [XPU] link out_max to x_max between xpu_fusion_ops (#50690) · 1fd1c169
  由 zhupengyang 提交于 2月 22, 2023
  
  1fd1c169
21 2月, 2023 1 次提交

Support bw invoke fw (#50260) · d8845735

由 HappyHeavyRain 提交于 2月 21, 2023

* support bw invoke fw

* fix scale in static_backward.yaml

* fix the bug in tensorrt/convert

* move 'scale','sign' into ops.yaml

* add scale_grad of scale in op_compat.yaml

* change generated_static_op in CMakeLists.txt

d8845735

20 2月, 2023 1 次提交
- S
  
  [XPU] fix fc_xpu_fuse_pass (#50569) · 77606f5d
  由 shentanyue 提交于 2月 20, 2023
  
  77606f5d
17 2月, 2023 2 次提交

upgrade oneDNN to 2.7.3 (#46301) · f803b239

由 Sławomir Siwek 提交于 2月 17, 2023

* change SHA

* update to oneDNN 2.7

* update to 2.7.1

* update to 2.7.2

* add supported hardsigmoid

* update to 2.7.3

* limit cpu threads for int8 test

* group activations

f803b239

Z
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass,... · 61469eec
由 zhupengyang 提交于 2月 17, 2023
```
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
```
61469eec

16 2月, 2023 4 次提交

Add matmul_v2 and fused_matmul to the quantization process and adjust Ernie model test (#50354) · 8686a745

由 joanna.wozna.intel 提交于 2月 16, 2023

* Add matmul_v2 to the quantization process and adjust Ernie model test

* Correct cpu_quantize_pass test

* Move op to fuse transformation to placement pass

* Correct test

8686a745

Rewrite mkldnn conv bn fuse pass tester (#50034) · e2aacd21

由 Hulek 提交于 2月 16, 2023

* New onednn test

* checkopoint

* added new test, fixed issue with onednn bias

* fix bias check

* remove prints, refactor code

* delete old test

* update python tests cmake

* Delete depracated conv bias

* Delete outdated bias from convolution test

e2aacd21

S
[XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
517d8074
Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

15 2月, 2023 2 次提交

Rewrite conv activation mkldnn fuse pass tester (#49278) · 84beef80

由 Hulek 提交于 2月 15, 2023

* Done

* Deleted old python test, fixed new python test, changed names in parallel_UT

* Revert parallel UT changes

* Revert parallel UT changes v2

* Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass

* delete sqrt from possible activations from conv_concat_relu test

* review refactor

* merge main

* delete sqrt from list of compatible activations

* Test with no outdated inputs

84beef80

[PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11

由 YuanRisheng 提交于 2月 15, 2023

* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment

8fabca11

14 2月, 2023 1 次提交
- D
  Expand mixed_precision to custom device (#50378) · fcb746cb
  由 duanyanhui 提交于 2月 14, 2023
```
* expand mix_precision to custom_device

* fix bug

* fix bug

* fix comment

* fix DEFINE bug
```
  fcb746cb
13 2月, 2023 1 次提交

Upgrade protobuf to 4.21.x (#49168) · 15d93394

由 risemeup1 提交于 2月 13, 2023

* upgrade protobuf to 3.19.0 in cmake

* recover protobuf python version

* fix distribute compile

* fix

* fix framework.data_feed_pb2

* fix macos ifdef

* fix lite

* test

* update protoc from 3.19.0 t0 3.20.0

* test

* debug

* test

* test

* debug

* debug

* debug

* debug

* test

* debug

* update protocol from 3.20.0 to 4.21.12

* modify graph_brpc_client.h

* modify graph_brpc_client.h

* test

* test

* test

* fix third_party cache problem on build ci

* updata proto

* test

* test

* test

* test

* test

* test

* fix coverage failed test

* try to fix test_exe_fleet_model_run

* fix cinn bug

* fix windows compile problem

* fix python/requirements

---------
Co-authored-by: Npangyoki <pangyoki@126.com>

15d93394

11 2月, 2023 1 次提交

[TRT] elementwise_add+transpose fusion (#50081) · fd0d4fa4

由 Wang Bojun 提交于 2月 11, 2023

* eleadd_trans first version

log fix

* refine code for linear format, add pass check

* linear format refine and ut fix

* fix ut

* windows ut

* windows ut 2

* move tensorMeta and alloc to configure

fd0d4fa4

10 2月, 2023 1 次提交
- Z
  
  [XPU] add fc_xpu op&pass to optimize ernie model (#50277) · 945f918c
  由 zhupengyang 提交于 2月 10, 2023
  
  945f918c
09 2月, 2023 2 次提交
- J
  Adjust mkldnn_placement_pass to check library type and data type (#49899) · ebdf3ef9
  由 joanna.wozna.intel 提交于 2月 09, 2023
```
* Adjust mkldnn_placement_pass to check library type and data type

* Check if var has inputs

* Remove unrelated test

* Refactor
```
  ebdf3ef9
- W
  [TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
  由 Wang Bojun 提交于 2月 09, 2023
```
* trans_layernorm
```
  b2bb7ec9
08 2月, 2023 3 次提交

fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe

由 Paulina Gacek 提交于 2月 08, 2023

* QuantTranpose pattern is being found by pass

* quant + transpose fuse

* code style changes

* UT written, reorder fixed

* Dequantize + transpose2 fuse  added

* pass name changed

* UT added & shift corrected

* got rid of redundancy

* review changes

* AsIntermediate corrected

* compat added

197a4ffe

S
Add bf16 support for fused matmul (#50254) · b47923b4
由 Sławomir Siwek 提交于 2月 08, 2023
```
* add support for bf16 fused_ops

* fused_matmul only
```
b47923b4
Y

Fused attention pass mp support (#50320) · e44ff495
由 Yuang Liu 提交于 2月 08, 2023

e44ff495

07 2月, 2023 1 次提交
- Y
  
  fix op_desc set attr bug (#50281) · 2755507c
  由 Yuanle Liu 提交于 2月 07, 2023
  
  2755507c
06 2月, 2023 2 次提交

Delete extra input (Bias, ResidualData) in OpMaker of conv2d (#49121) · 2deada9a

由 zyfncg 提交于 2月 06, 2023

* remove extra input of conv2d

* fix bug

* fix unittest bug

* adjust conv2d.pbtxt

* fix cpu_quantize_pass_tester

* revert use_addto of conv2d

* fix runtime attribute

* fix bug

* recover force_fp32_output in conv2d

* refine error info

* fix bug

2deada9a

Y

Fused attn pass single ut (#50227) · fcec564c
由 Yuang Liu 提交于 2月 06, 2023

fcec564c

03 2月, 2023 3 次提交

Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645

由 Sławomir Siwek 提交于 2月 03, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

5cfe1645

Rewrite conv testers from cpp to python (#49582) · aa8cef4a

由 Paulina Gacek 提交于 2月 03, 2023

* conv_bias_mkldnn_fuse_pass_tester rewritten

* conv_concat_relu_mkldnn_fuse_pass_tester rewritten

* conv_elementwise_add_fuse_pass_tester rewritten

* mkldnn changed to onednn

* tests added to cmakeLists, style fix

* got rid of unnecessary UT, some style changes

* changes in naming convention

* max_examples reduced

* time out added

aa8cef4a

Y

Fused attention pass backward op replace. (#50186) · 7e8ef328
由 Yuang Liu 提交于 2月 03, 2023

7e8ef328

01 2月, 2023 2 次提交

Y

Fused attention pass fwd, create the fused_attention op. (#50125) · 2b848aef
由 Yuang Liu 提交于 2月 01, 2023

2b848aef

Preln fix (#49802) · e03718f5

由 Wang Bojun 提交于 2月 01, 2023

* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix

e03718f5

31 1月, 2023 2 次提交
- W
  gn_silu (#49928) · 111075a3
  由 wenbin 提交于 1月 31, 2023
```
* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter
```
  111075a3
- Z
  
  [pass] Upgrade Constant Folding Pass (#49908) · c3cd8502
  由 Zhang Jun 提交于 1月 31, 2023
  
  c3cd8502
30 1月, 2023 1 次提交
- G
  
  depthwise_conv 映射成 conv的逻辑中添加下cudnn版本的判断 (#50058) · 320958eb
  由 gem5 提交于 1月 30, 2023
  
  320958eb
29 1月, 2023 1 次提交
- Y
  
  Fused attention pass backward pattern (#49855) · 8e02f290
  由 Yuang Liu 提交于 1月 29, 2023
  
  8e02f290
18 1月, 2023 1 次提交

Handle repetitive code in oneDNN activation fuse passes (#49824) · a1b2e1e2

由 Sławomir Siwek 提交于 1月 18, 2023

* extract fuse pass logic to header file

* adjust namespaces

* Update paddle/fluid/framework/ir/mkldnn/activation_onednn_fuse_pass.h

update date
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* add inline remove static
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

a1b2e1e2

17 1月, 2023 1 次提交

Rewrite mat reshape transpose testers (#49580) · d9d47dc6

由 Paulina Gacek 提交于 1月 17, 2023

* reshape_transpose_matmul_pass_tester rewritten

* matmul_transpose_reshape_pass_tester rewritten

* mkldnn to onednn

d9d47dc6

16 1月, 2023 1 次提交
- Y
  [Paddle-TRT] support nhwc (#49633) · e43f7102
  由 Yuanle Liu 提交于 1月 16, 2023
```
* add trt_support_nhwc_pass
```
  e43f7102

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功