提交 · db407bf0531ea8f3deb183c110a6e79cc2389344 · PaddlePaddle / Paddle

12 5月, 2023 1 次提交
- W
  
  fix gpu mem alloc: use phi::memory_utils::Alloc (#53721) · 6cd7609a
  由 Wangzheee 提交于 5月 12, 2023
  
  6cd7609a
08 5月, 2023 1 次提交
- Y
  [Paddle-TRT] add generic plugin for lookup_table_v2(embedding) op (#53539) · fca8595e
  由 Yuanle Liu 提交于 5月 08, 2023
```
* add embedding generic plugin， not enabled
```
  fca8595e
25 4月, 2023 1 次提交
- Y
  
  [Paddle Inference] add generic plugin for p_norm (#53278) · 00f747f2
  由 Yuanle Liu 提交于 4月 25, 2023
  
  00f747f2
13 4月, 2023 1 次提交
- Z
  [Paddle-TRT]fix bilinear_interp_v2 && some other bugs in trt 7011 (#52753) · dc8d6a1a
  由 zhoutianzi666 提交于 4月 13, 2023
```
* fix bilinear_interp_v2 && some other bugs in trt 7011

* add version check in test_trt_convert_bilinear_interp_v2.py
```
  dc8d6a1a
31 3月, 2023 1 次提交

FIX_LINUX_Wternimate (#52307) · ffff133b

由 Galaxy1458 提交于 3月 31, 2023

* this is a test pr, test=develop

* solve the four [-Wterminate] warning, test=develop

* solve the four [-Wterminate] warning, test=develop

* new fix [-Wterminate], test=delelop

* new fix [-Wterminate], test=delelop

* new fix [-Wterminate], test=delelop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

ffff133b

28 3月, 2023 1 次提交

Add basic functionalities to support Scalar & Scalars in op attr (#51984) · 2e9fd5e4

由 Feiyu Chan 提交于 3月 28, 2023

Add basic functionalities to support Scalar & Scalars in operator attribute.

1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types;
2. Scalar enhancement, add formatting, equality;
3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc;
4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`;
5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto);
6. add code to manipulate Scalar proto message via protobuffer python API;

Add unittests.

1. add test cases for formatting, equality for Scalars, and WrapAsScalars;
2. add test cases for 'casting' between different morphs of attributes;
3. add test cases for extracting scalar & scalars from attribute;
4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset);
5. fix gmock's library filename on windows platform.
6. clean code: use canonicalize_attrs instead of inlining the function;
7. add test cases for libpaddle.Scalar in python code.
8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.

2e9fd5e4

22 3月, 2023 1 次提交
- W
  fix embd for S (#51937) · 40115c7e
  由 Wangzheee 提交于 3月 22, 2023
```
fix embd plugin: S = mask_id.d[1]
```
  40115c7e
21 3月, 2023 1 次提交
- Z
  [Paddle-TRT] fix GN when params.c% params.cPerBlock != 0 (#51836) · e35afed7
  由 zhoutianzi666 提交于 3月 21, 2023
```
* fix GN when params.c% params.cPerBlock != 0

* fix GN when params.cnot divisable by params.cPerBlock
```
  e35afed7
16 3月, 2023 1 次提交
- X
  Add Deformable Conv Dynamic Shape Support (#50698) · 86bf8274
  由 xjmxyt 提交于 3月 16, 2023
```
* add dynamic support

* add more test

* fix bug

* change test

* change test
```
  86bf8274
24 2月, 2023 1 次提交
- Z
  [Paddle-TRT] Fix QkvToContextPluginDynamic bug (#50715) · 612d5da0
  由 zhoutianzi666 提交于 2月 24, 2023
```
* fix multihead

* fix multihead
```
  612d5da0
20 2月, 2023 1 次提交
- W
  
  fix mutable_data() (#50396) · c47f11f5
  由 Wang Bojun 提交于 2月 20, 2023
  
  c47f11f5
16 2月, 2023 1 次提交

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

11 2月, 2023 1 次提交

[TRT] elementwise_add+transpose fusion (#50081) · fd0d4fa4

由 Wang Bojun 提交于 2月 11, 2023

* eleadd_trans first version

log fix

* refine code for linear format, add pass check

* linear format refine and ut fix

* fix ut

* windows ut

* windows ut 2

* move tensorMeta and alloc to configure

fd0d4fa4

09 2月, 2023 2 次提交

[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0

由 zhoutianzi666 提交于 2月 09, 2023

* add fmha_flashattention oss plugin

* add fmhca

* add oss fmhca

* code reconstruct and add ut

* code style refine

* fix ut and enforce check

* refine trt version check

refine compile

fix compile

* fix cross ut

* code refine

* use runtime trt version check

* bug fix and code refine

* compile fix

* merge develop

* add GN QDQ kernel

* support GN int8 fake kernel

* add with_int8

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8  UT

* add verison > 8000  in GN int8  UT

* add some check in .cu

* add stdlib.h in UT

* little change  in .cu

* remove rand_r use rand

* remove use rand

* setAxis(1)

* when int8 is on allow fall back to fp16

---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>

d93c63a0

W
[TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
由 Wang Bojun 提交于 2月 09, 2023
```
* trans_layernorm
```
b2bb7ec9

31 1月, 2023 1 次提交

gn_silu (#49928) · 111075a3

由 wenbin 提交于 1月 31, 2023

* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter

111075a3

12 1月, 2023 1 次提交
- W
  more preln_gn patterns (#49728) · adcb0039
  由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
  adcb0039
11 1月, 2023 1 次提交
- W
  Compile fix (#49690) · 2fe896df
  由 wenbin 提交于 1月 11, 2023
```
* compile fix

* fix compile

* compile fix
```
  2fe896df
10 1月, 2023 3 次提交
- X
  
  fix_generic_fp16 (#49682) · b854f259
  由 xiaoxiaohehe001 提交于 1月 10, 2023
  
  b854f259
- W
  gn bug fix (#49658) · acab7daf
  由 wenbin 提交于 1月 10, 2023
```
* gn bug fix

* bug fix

* gn bug fix
```
  acab7daf
- Refine name style and MoeKernel (#49432) · 39210ed0
  由 MarDino 提交于 1月 10, 2023
  
  39210ed0
09 1月, 2023 1 次提交

Preln groupnorm (#49463) · 591be3bd

由 wenbin 提交于 1月 09, 2023

* skip_groupnorm

* init

* preln

* add ut

* more assert

* set timeout

* fix windows ci issue

591be3bd

23 12月, 2022 2 次提交
- L
  
  make FusedMultiTransformer supports RoPE (#48842) · 644dfc60
  由 lzy 提交于 12月 23, 2022
  
  644dfc60
- W
  [Paddle Inference]add ouutput(CLSInds) for fused_token_prune (#49271) · 4ed6eeab
  由 Wangzheee 提交于 12月 23, 2022
```
* add ouutput(CLSInds) for fused_token_prune
```
  4ed6eeab
21 12月, 2022 1 次提交
- W
  [Paddle Inference]optimize token prune for no varlen (#49094) · 65c17315
  由 Wangzheee 提交于 12月 21, 2022
```
* optimize token prune for no varlen
```
  65c17315
20 12月, 2022 1 次提交
- W
  groupnorm nhwc8 (#49160) · babd26ee
  由 wenbin 提交于 12月 20, 2022
```
* gn nhwc8

* remove error
```
  babd26ee
19 12月, 2022 1 次提交
- W
  [Paddle Inference] General optimization for no_varlen skiplayernorm (#49039) · b50dbe0b
  由 Wangzheee 提交于 12月 19, 2022
```
* General optimization for no_varlen embedding layernorm
```
  b50dbe0b
15 12月, 2022 1 次提交
- W
  
  fix embedding multihead (#49085) · 439b2b94
  由 Wangzheee 提交于 12月 15, 2022
  
  439b2b94
13 12月, 2022 2 次提交
- W
  
  Enable Generic-Plugin support FP16 (#48807) · 5d49e3e9
  由 weishengying 提交于 12月 13, 2022
  
  5d49e3e9
- W
  [Paddle Inference]fix some transformer unitest (#48929) · cb7f736f
  由 Wangzheee 提交于 12月 13, 2022
```
* fix some transformer unitest
```
  cb7f736f
08 12月, 2022 1 次提交
- W
  [Paddle Inference] General optimization for no_varlen embedding layernorm (#48580) · 22bfa579
  由 Wangzheee 提交于 12月 08, 2022
```
* general optimization no_varlen embedding layernorm
```
  22bfa579
05 12月, 2022 1 次提交

Reverse roll fuse (#46914) · feb68dd1

由 Wang Bojun 提交于 12月 05, 2022

* pass

* pass

* draft version

* share mem opt

* remove sharemem

* add pattern for the case with circle_shift=0

* add UT

* pass opt

* test_fix

* code-commit

* code-style

* code style

* code-style

* ut-fix

* op teller refine

* resolve conflict

* adjust position op_teller list and pass order for swin

* ut code style update

* adjust paddle pass order

* refine pass order

* refine pass order

* refine pass order

feb68dd1

01 12月, 2022 3 次提交
- W
  [Paddle Inference] General optimization for no_varlen multihead (#48469) · e5cf75d8
  由 Wangzheee 提交于 12月 01, 2022
```
* general optimization for no_varlen multihead
```
  e5cf75d8
- Z
  [inference][trt] dynamic shape support for Instance norm (#47998) · 758fccfe
  由 Zhang Jun 提交于 12月 01, 2022
```
* instance norm support dynamic shape
* update unittest
```
  758fccfe
- Z
  [inference][trt] Fp16 support for Generic plugin (#48253) · 2bdad6cd
  由 Zhang Jun 提交于 12月 01, 2022
```
* Support FP16 in generic TensorRT plugin.
* Support FP16 for Pad3D.
```
  2bdad6cd
28 11月, 2022 1 次提交
- W
  fix: multihead matmul biasqk broadcast support for [1,1,seq,seq] shape (#47975) · 11b9d85f
  由 Wang Bojun 提交于 11月 28, 2022
```
* add trt support
```
  11b9d85f
25 11月, 2022 3 次提交
- Z
  fix loopup_table plugin deserialize size error (#48379) · 128ef1ae
  由 zhangxin81 提交于 11月 25, 2022
```
* fix loopup_table plugin deserialize size error
```
  128ef1ae
- W
  [Paddle Inference]fix token prune plugin (#48367) · c6de4342
  由 Wangzheee 提交于 11月 25, 2022
```
* fix
```
  c6de4342
- W
  Group norm fp16 support (#48222) · 34fd65cf
  由 Wang Bojun 提交于 11月 25, 2022
```
* group norm fp16 support
```
  34fd65cf
24 11月, 2022 1 次提交
- W
  [Paddle Inference]optimize token prune for Paddle-TensorRT (#48241) · 29782728
  由 Wangzheee 提交于 11月 24, 2022
```
* optimize token prune
```
  29782728

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功