提交 · d93c63a049091bd68465eb320ec8bb352384751f · BaiXuePrincess / Paddle

09 2月, 2023 2 次提交

[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0

由 zhoutianzi666 提交于 2月 09, 2023

* add fmha_flashattention oss plugin

* add fmhca

* add oss fmhca

* code reconstruct and add ut

* code style refine

* fix ut and enforce check

* refine trt version check

refine compile

fix compile

* fix cross ut

* code refine

* use runtime trt version check

* bug fix and code refine

* compile fix

* merge develop

* add GN QDQ kernel

* support GN int8 fake kernel

* add with_int8

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8  UT

* add verison > 8000  in GN int8  UT

* add some check in .cu

* add stdlib.h in UT

* little change  in .cu

* remove rand_r use rand

* remove use rand

* setAxis(1)

* when int8 is on allow fall back to fp16

---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>

d93c63a0

W
[TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
由 Wang Bojun 提交于 2月 09, 2023
```
* trans_layernorm
```
b2bb7ec9

08 2月, 2023 2 次提交

Z
[inference][trt] Disable ShapeTensor for nearest_interp_v2 when trt version < 8.2 (#50258) · fa284076
由 Zhang Jun 提交于 2月 08, 2023
```
* update

* update

* format code

* update

* Update test_trt_convert_nearest_interp_v2.py
```
fa284076

[Paddle-TRT] remove engine info from RumImpl process (#50181) · b3888614

由 gaoziyuan 提交于 2月 08, 2023

* remove_engine_info

* remove_engine_info

* remove_engine_info

* change trtlayerinformation line to json

---------
Co-authored-by: Ngaoziyuan <gaoziyuan@baidu.com>

b3888614

07 2月, 2023 1 次提交
- X
  
  [Paddle Inference] Fix range fp32 input below trt 8.4. (#50221) · 766a4ca9
  由 xiaoxiaohehe001 提交于 2月 07, 2023
  
  766a4ca9
06 2月, 2023 2 次提交
- W
  
  fix param (#50226) · 708f6e79
  由 wenbin 提交于 2月 06, 2023
  
  708f6e79
- X
  [Paddle Inference] Add Hasattri check of op teller. (#50110) · 4b4d92ea
  由 xiaoxiaohehe001 提交于 2月 06, 2023
```
* add_hasattri_check

* add_hasattri_check
```
  4b4d92ea
01 2月, 2023 1 次提交

Preln fix (#49802) · e03718f5

由 Wang Bojun 提交于 2月 01, 2023

* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix

e03718f5

31 1月, 2023 3 次提交
- W
  gn_silu (#49928) · 111075a3
  由 wenbin 提交于 1月 31, 2023
```
* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter
```
  111075a3
- W
  Unary (#49914) · 0d9185b9
  由 wenbin 提交于 1月 31, 2023
```
* disable integer

* disable integer

* add cast layer
```
  0d9185b9
- Z
  
  [inference][trt] add elementwise input data type check (#49675) · 5822e15c
  由 Zhang Jun 提交于 1月 31, 2023
  
  5822e15c
18 1月, 2023 1 次提交
- W
  fix cast issue (#49909) · 55ccb429
  由 wenbin 提交于 1月 18, 2023
```
* fix cast issue

* add ut
```
  55ccb429
17 1月, 2023 1 次提交

[PHI]Change feed_op to phi kernel (#49116) · f7f1dc03

由 YuanRisheng 提交于 1月 17, 2023

* change feed_op to phi kernel

* fix ci bugs

* fix build bugs

* fix ci bugs

* fix compile bugs

* fix ci bugs

* perfect code

* perfect comment code

* fix install bugs

* modify code according comment

* remove visitor in feed_op

* modify according comment

* perfect code according comment

* add infershape

* fix py3 bugs

* fix getexpected kernel type

* fix getexpected kernel type

* fix ci bugs

* add registry for custom device

* fix py3 bugs

* fix floating point error

* fix py3 test bugs

f7f1dc03

16 1月, 2023 1 次提交
- Z
  [inference] Use output var name to mark the NVTX flag (#49825) · ea2e2495
  由 Zhang Jun 提交于 1月 16, 2023
```
* add outvar name for nvtx mark

* nly network created with kEXPLICIT_BATCH can setsetMaxBatchSize
```
  ea2e2495
13 1月, 2023 2 次提交

W
add oss flash fmha and fmhca support (#49438) · a48b8e2c
由 Wang Bojun 提交于 1月 13, 2023
```
* add fmha_flashattention oss plugin
```
a48b8e2c

[inference][trt]set output data type of trt network (#49712) · 690d7a69

由 Zhang Jun 提交于 1月 13, 2023

* update trt engine to set in/out data type

* update

* Update engine.cc

* Update engine.cc

* update

* set engine output type before freeze the network

* update

* update trt autoscan ut

* update

* update ut

* fix equal bug, update ut

* fix cast and equal ut

* update cast ut using TRT < 8.4

* set datatype from scope

* check output var is nullptr

* Update op_converter.h

* update tensorrt_engine_op_test ut

* update

690d7a69

12 1月, 2023 2 次提交
- X
  
  fix_arg (#49770) · 5d60ff91
  由 xiaoxiaohehe001 提交于 1月 12, 2023
  
  5d60ff91
- W
  more preln_gn patterns (#49728) · adcb0039
  由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
  adcb0039
11 1月, 2023 2 次提交
- W
  Compile fix (#49690) · 2fe896df
  由 wenbin 提交于 1月 11, 2023
```
* compile fix

* fix compile

* compile fix
```
  2fe896df
- W
  
  fix qk bias for multihead (#49702) · 6578da51
  由 Wangzheee 提交于 1月 11, 2023
  
  6578da51
10 1月, 2023 5 次提交
- X
  
  fix_generic_fp16 (#49682) · b854f259
  由 xiaoxiaohehe001 提交于 1月 10, 2023
  
  b854f259
- X
  
  add_group_norm_int8 (#49229) · 7c6f5f28
  由 xiaoxiaohehe001 提交于 1月 10, 2023
  
  7c6f5f28
- S
  
  Add reduce_min prod trt converter (#49615) · 13992de7
  由 Sanbu 提交于 1月 10, 2023
  
  13992de7
- W
  gn bug fix (#49658) · acab7daf
  由 wenbin 提交于 1月 10, 2023
```
* gn bug fix

* bug fix

* gn bug fix
```
  acab7daf
- Refine name style and MoeKernel (#49432) · 39210ed0
  由 MarDino 提交于 1月 10, 2023
  
  39210ed0
09 1月, 2023 2 次提交
- Y
  
  process warnings when compiled with gcc11.2 (#49651) · 7d490b2d
  由 Yuanle Liu 提交于 1月 09, 2023
  
  7d490b2d
- W
  Preln groupnorm (#49463) · 591be3bd
  由 wenbin 提交于 1月 09, 2023
```
* skip_groupnorm

* init

* preln

* add ut

* more assert

* set timeout

* fix windows ci issue
```
  591be3bd
05 1月, 2023 2 次提交
- X
  
  [Paddle Inference] Add ci flags for a persistent IBuilder. (#49538) · fcd6d675
  由 xiaoxiaohehe001 提交于 1月 05, 2023
  
  fcd6d675
- Z
  [inference][trt]Upgrade expand cast nearestinterp for sd (#48998) · 5defefd6
  由 Zhang Jun 提交于 1月 05, 2023
```
* update nearest_interp, expand_v2, cast for stable diffusion

* update nearest_interp, expand_v2, cast for stable diffusion

* correct shape rank

* Update expand_v2_op.cc
```
  5defefd6
04 1月, 2023 1 次提交
- Y
  
  update vlog output (#49541) · bbc6dd94
  由 Yuanle Liu 提交于 1月 04, 2023
  
  bbc6dd94
03 1月, 2023 2 次提交
- Z
  
  forbid ops who have 1D intermediate tensor entering Paddle-TRT (#49378) · 021085e3
  由 zhoutianzi666 提交于 1月 03, 2023
  
  021085e3
- S
  
  Add not_equal trt converter (#49393) · 822ea0f9
  由 Sanbu 提交于 1月 03, 2023
  
  822ea0f9
30 12月, 2022 2 次提交

Z
[inference][trt] update Convolution to ConvolutionNd (#47653) · 6e5917e4
由 Zhang Jun 提交于 12月 30, 2022
```
* update conv to convNd

* trigger ci
```
6e5917e4

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

28 12月, 2022 1 次提交
- Y
  
  update some trt log (#49330) · 02019804
  由 Yuanle Liu 提交于 12月 28, 2022
  
  02019804
23 12月, 2022 3 次提交
- L
  
  make FusedMultiTransformer supports RoPE (#48842) · 644dfc60
  由 lzy 提交于 12月 23, 2022
  
  644dfc60
- Z
  
  [inference][trt] argmax support int64 dtype (#49167) · fd1730e4
  由 Zhang Jun 提交于 12月 23, 2022
  
  fd1730e4
- W
  [Paddle Inference]add ouutput(CLSInds) for fused_token_prune (#49271) · 4ed6eeab
  由 Wangzheee 提交于 12月 23, 2022
```
* add ouutput(CLSInds) for fused_token_prune
```
  4ed6eeab
22 12月, 2022 2 次提交
- W
  [Paddle Inference]fix gather_nd (#49266) · 80f5e25e
  由 Wangzheee 提交于 12月 22, 2022
```
* fix reshape, gather_nd
```
  80f5e25e
- X
  
  [Paddle Inference] Add moe phi kernel (#48703) · def2a87f
  由 xiaoxiaohehe001 提交于 12月 22, 2022
  
  def2a87f

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致