提交 · 4a09da02441a1b0c2afd83d3cdc83aa57e9040ad · 机器未来 / Paddle

04 3月, 2022 1 次提交

[paddle-inference]support setting fully connected in multi-head attention... · 8dbfc2ae

由 ceci3 提交于 3月 04, 2022

[paddle-inference]support setting fully connected in multi-head attention static shape branch to int8  (#39660)

* fix inference int

* update

* add unittest

8dbfc2ae

11 2月, 2022 1 次提交

[Paddle Inference] support ernie quant model with interleaved (#39424) · 1c44d3e2

由 Wangzheee 提交于 2月 11, 2022

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

1c44d3e2

24 9月, 2021 1 次提交
- B
  add multihead_matmul trt converter test case (#36023) · fcaa64b3
  由 baoachun 提交于 9月 24, 2021
```
* add multihead_matmul trt converter test case

* move attribute check to op_teller
```
  fcaa64b3
07 9月, 2021 1 次提交
- C
  
  fix int8 (#35504) · ed97be09
  由 ceci3 提交于 9月 07, 2021
  
  ed97be09
30 8月, 2021 1 次提交
- C
  [paddle-TRT]support matmul set to int8 in multihead (#34917) · 0043fa8c
  由 ceci3 提交于 8月 30, 2021
```
* update ernie int8
```
  0043fa8c
17 6月, 2021 1 次提交
- W
  [Inference Tensorrt] Add attr for trt engine and handle the input seq problem... · 67bec55c
  由 Wilber 提交于 6月 17, 2021
```
[Inference Tensorrt] Add attr for trt engine and handle the input seq problem for ernie var len. (#33575)
```
  67bec55c
16 4月, 2021 1 次提交
- C
  support ernie trt-int8 for inference (#32232) · 6da043eb
  由 ceci3 提交于 4月 16, 2021
```
* support ernie trt-int8 for inference

* fix reshape
```
  6da043eb
02 4月, 2021 1 次提交
- W
  update trt engine addplugin name. (#32018) · d9187869
  由 Wilber 提交于 4月 02, 2021
```
* update trt engine addplugin name.

* update
```
  d9187869
23 3月, 2021 1 次提交

fix tensorrt output varible reshape (#31733) · 9d04ef73

由 Shang Zhizhou 提交于 3月 23, 2021

* fix tensorrt output varible reshape

* move padding shape x 1 x 1 in ernie to qkv and fc

* update layer name

* fix softmax when input is dynamic, fc not padding any more

* fix varlen

* move fc x_dim assert to op_teller

9d04ef73

10 3月, 2021 1 次提交
- S
  
  fix ernie_varlen when cutting head (#31497) · f57739be
  由 Shang Zhizhou 提交于 3月 10, 2021
  
  f57739be
27 11月, 2020 1 次提交

detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01

由 Shang Zhizhou 提交于 11月 27, 2020

* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake

* comile with cuda9

* add some unittest

* notest;test=coverage

* add unittest for trt plugin swish && split

* update ernie unittest

* fix some error message

* remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter

* fix comile errror when CUDA_ARCH_NAME < Pascal"

* fix comile error

* update unittest timeout

* compile with cuda9

* update error msg

* fix code style

* add some comments

* add define IF_CUDA_ARCH_SUPPORT_FP16

* rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED

b9e76a01

03 11月, 2020 1 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

06 1月, 2020 1 次提交

Add TRT support for BERT (#21135) · 0a51098a

由 Pei Yang 提交于 1月 06, 2020

* add gelu plugin

* align trt bert with gpu

* add support for fused fc with relu,

* add unittest for bert trt

0a51098a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致