提交 · 1007690b253190e432340aa950af7982f637f750 · PaddlePaddle / Paddle

18 5月, 2023 1 次提交

[inference][trt]Remove trt sparse weight api (#53905) · 1007690b

由 Zhang Jun 提交于 5月 18, 2023

* Revert "[inference][trt]add trt sparse weights switch (#53562)"

This reverts commit 4a69a536.

* remove kSPARSE_WEIGHTS

* remove kFASTER_DYNAMIC_SHAPES_0805 and add 'TrtMajorVersion' function

1007690b

16 5月, 2023 1 次提交
- Y
  [Inference] clean unused code/target for reduce inference so volume (PART I) (#53762) · 51ecd933
  由 Yuanle Liu 提交于 5月 16, 2023
```
* remove prelu land ookuip_table plugin, adjust .h include location

* clean code and adjust some .h

* update
```
  51ecd933
11 5月, 2023 1 次提交
- Z
  
  [inference][trt]add trt sparse weights switch (#53562) · 4a69a536
  由 Zhang Jun 提交于 5月 11, 2023
  
  4a69a536
09 5月, 2023 1 次提交
- W
  
  Support trt cuda graph. (#53406) · ea0abf93
  由 Wilber 提交于 5月 09, 2023
  
  ea0abf93
25 4月, 2023 1 次提交

[PHI]Add flags macro for PHI (#52991) · 22e96bde

由 YuanRisheng 提交于 4月 25, 2023

* add flags for phi

* fix compile bugs

* fix ci bugs

* fix inference bugs

* fix cinn' bugs

* fix cinn bugs

* perfect code according comment

* fix ci bugs

* fix ci bugs

22e96bde

21 4月, 2023 1 次提交

support 0-D output and 0-D as indice in __getitem__/__setitem__ (#52814) · 4e939c89

由 JYChen 提交于 4月 21, 2023

* support 0-D output and 0-D as indice in __getitem__

* fix tests

* fix inference and UT

* add unittest for setitem

* fix xpu test

* fix xpu 0-d

4e939c89

17 4月, 2023 1 次提交
- J
  
  Support trt engine auto build in runtime for dynamic shape (#52162) · ebc58548
  由 JingZhuangzhuang 提交于 4月 17, 2023
  
  ebc58548
27 2月, 2023 1 次提交
- G
  
  change message info (#50546) · 097402d9
  由 gaoziyuan 提交于 2月 27, 2023
  
  097402d9
24 2月, 2023 1 次提交
- Z
  [Paddle-TRT] allow plugin fall back to fp16 when int8 (#50554) · f24eadd9
  由 zhoutianzi666 提交于 2月 24, 2023
```
* allow fall back to fp16 when int8

* refine code

* refine code

* refine code
```
  f24eadd9
09 2月, 2023 1 次提交

[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0

由 zhoutianzi666 提交于 2月 09, 2023

* add fmha_flashattention oss plugin

* add fmhca

* add oss fmhca

* code reconstruct and add ut

* code style refine

* fix ut and enforce check

* refine trt version check

refine compile

fix compile

* fix cross ut

* code refine

* use runtime trt version check

* bug fix and code refine

* compile fix

* merge develop

* add GN QDQ kernel

* support GN int8 fake kernel

* add with_int8

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8  UT

* add verison > 8000  in GN int8  UT

* add some check in .cu

* add stdlib.h in UT

* little change  in .cu

* remove rand_r use rand

* remove use rand

* setAxis(1)

* when int8 is on allow fall back to fp16

---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>

d93c63a0

13 1月, 2023 1 次提交

[inference][trt]set output data type of trt network (#49712) · 690d7a69

由 Zhang Jun 提交于 1月 13, 2023

* update trt engine to set in/out data type

* update

* Update engine.cc

* Update engine.cc

* update

* set engine output type before freeze the network

* update

* update trt autoscan ut

* update

* update ut

* fix equal bug, update ut

* fix cast and equal ut

* update cast ut using TRT < 8.4

* set datatype from scope

* check output var is nullptr

* Update op_converter.h

* update tensorrt_engine_op_test ut

* update

690d7a69

05 1月, 2023 1 次提交
- X
  
  [Paddle Inference] Add ci flags for a persistent IBuilder. (#49538) · fcd6d675
  由 xiaoxiaohehe001 提交于 1月 05, 2023
  
  fcd6d675
04 1月, 2023 1 次提交
- Y
  
  update vlog output (#49541) · bbc6dd94
  由 Yuanle Liu 提交于 1月 04, 2023
  
  bbc6dd94
28 12月, 2022 1 次提交
- Y
  
  update some trt log (#49330) · 02019804
  由 Yuanle Liu 提交于 12月 28, 2022
  
  02019804
20 12月, 2022 1 次提交
- J
  
  reset context clear profile_num (#49188) · d5366c47
  由 JingZhuangzhuang 提交于 12月 20, 2022
  
  d5366c47
15 12月, 2022 1 次提交
- Z
  
  Add a persistent ibuilder to speedup unit test (#48906) · 9bd20aa0
  由 zlsh80826 提交于 12月 15, 2022
  
  9bd20aa0
10 12月, 2022 1 次提交
- Z
  [Paddle-TRT] add cast between int64 tensor and Paddle-TRT (#45547) · fd373579
  由 zhoutianzi666 提交于 12月 10, 2022
```
* Add cast between int64 tensor and Paddle-TRT
* Add Unit testing.
```
  fd373579
05 12月, 2022 1 次提交
- X
  [Paddle Inference] Support range trt converter and add scalar interface. (#48697) · aee2db01
  由 xiaoxiaohehe001 提交于 12月 05, 2022
```
* add_range

* add_range
```
  aee2db01
14 11月, 2022 1 次提交
- X
  
  [Paddle Inference] Add where trt converter (#47820) · dac0f7dd
  由 xiaoxiaohehe001 提交于 11月 14, 2022
  
  dac0f7dd
12 10月, 2022 1 次提交
- Z
  
  [Paddle-TRT]support shape tensor is the input of trt-subgraph (#46482) · f2a778c9
  由 zhoutianzi666 提交于 10月 12, 2022
  
  f2a778c9
28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

22 9月, 2022 1 次提交
- Y
  
  TensorRT engine context memory sharing (#45842) · 173b39bb
  由 Yuanle Liu 提交于 9月 22, 2022
  
  173b39bb
20 9月, 2022 1 次提交
- Z
  [Paddle-TRT] Full support for ops with persistable input (#45545) · 668ffd59
  由 zhoutianzi666 提交于 9月 20, 2022
```
* Move ITensor construction for Weight (persistable variable) from OpConvert to TensorRTEngine.
```
  668ffd59
29 8月, 2022 1 次提交
- Y
  
  TensorRT Engine context memory bind with predictor id (#45468) · 02621079
  由 Yuanle Liu 提交于 8月 29, 2022
  
  02621079
15 8月, 2022 1 次提交
- Y
  
  fused_embedding_eltwise_layernorm_op and skip_layernorm_op support fp16 (#44969) · ac0553a0
  由 Yuanle Liu 提交于 8月 15, 2022
  
  ac0553a0
05 8月, 2022 1 次提交

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

01 8月, 2022 1 次提交
- W
  [Paddle Inference] add varlen_token_prune plugin, pass, convert (#44733) · 24187fcb
  由 Wangzheee 提交于 8月 01, 2022
```
* add varlen_token_prune plugin, pass, convert
```
  24187fcb
08 7月, 2022 1 次提交
- W
  
  Inference support mixed-precision model [3] (#44057) · 7f958728
  由 Wilber 提交于 7月 08, 2022
  
  7f958728
06 7月, 2022 1 次提交
- Z
  [Paddle-TRT] support inpus is weight (#44051) · 3fd6f09f
  由 zhoutianzi666 提交于 7月 06, 2022
```
* support inpus is weight
```
  3fd6f09f
01 7月, 2022 1 次提交
- Z
  [inference TRT]template GetWeightCPUData (#43993) · 76156d12
  由 zhoutianzi666 提交于 7月 01, 2022
```
* template GetWeightCPUData
```
  76156d12
26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
13 6月, 2022 1 次提交
- Z
  
  add only split (#43424) · 30b10630
  由 zhoutianzi666 提交于 6月 13, 2022
  
  30b10630
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
02 6月, 2022 1 次提交
- W
  [Paddle-Inference] new general transformer inference support (#43077) · 2810dfea
  由 Wangzheee 提交于 6月 02, 2022
```
* new general transformer inference support
```
  2810dfea
02 4月, 2022 1 次提交
- W
  [Paddle inference] support new quant_model (#41049) · 1b58ce14
  由 Wangzheee 提交于 4月 02, 2022
```
* paddle inference support new quant_model
```
  1b58ce14
03 3月, 2022 1 次提交
- W
  EmbEltwiseLayernorm fix (#40015) · c3f3643b
  由 wenbin 提交于 3月 03, 2022
```
* emb fix

* fix trt6 compile

* fix half

* absolute error fix
```
  c3f3643b
11 2月, 2022 1 次提交
- L
  
  Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
  由 Leo Chen 提交于 2月 11, 2022
  
  69793a27
18 1月, 2022 1 次提交

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

17 1月, 2022 1 次提交
- W
  disable unsupported trt dimension (#38962) · 55e9087f
  由 wenbin 提交于 1月 17, 2022
```
* develop test

* throw

* ne

* wrong cnt
```
  55e9087f
13 1月, 2022 1 次提交
- W
  [Paddle-Inference] add Paddle Trt config: with_interleaved (#38884) · dccdc719
  由 Wangzheee 提交于 1月 13, 2022
```
* add Paddle Trt config: with_interleaved
```
  dccdc719

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功