提交 · 35c5b23f68bb4259ac8153fd85e650a11a3d5e24 · PaddlePaddle / Paddle

04 2月, 2021 1 次提交
- W
  use iwyu clean include second time, test=develop (#30829) · 35c5b23f
  由 wanghuancoder 提交于 2月 04, 2021
```
* use iwyu clean include second time, test=develop
```
  35c5b23f
13 1月, 2021 1 次提交

Added support for inference using quantization aware trained dygraph (#30288) · 7bbf3ac5

由 alncat 提交于 1月 13, 2021

* added support for inference using qunatization aware trained dygraph

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* Delete incorrect warning message (#30196)

* fix warning and no grad

* clean redundant API alias in 2.0 - part 2 (#30013)

* delete paddle.nn.functional.assign

* fix dynamic to static error

* just add the op error message for the matmul xpu (#30246)

 add the op error message for the matmul xpu

* Add Static Variable Clone (#30208)

Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat

* use wget to replace curl to download the lcov file (#30229)

* use wget to replace curl to download the lcov file

* add cache for lcov

* fix test_pool3d_op timeout issue (#30248)

* Fix unittests bugs. (#30250)

* modify error message based on comments (#30189)

* modify error message based on comments

* edit code according to review.

* Correct spelling according to review.

* Fix bug for 'save mutiple method' (#30218)

* Fix bug for 'save mutiple method'

* To pass coverage.

* edit code to pass coverage.

* edit code to pass coverage.

* add unittest for coverage.

* change for coverage.

* edit for coverage.

* added support for inference using qunatization aware trained dygraph

* Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)

* add alias from  fluid.layers.auc to static.auc

* Update __init__.py

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* corrected boost get usage

* corrected naming issues and enforcing zero check

* correct paddle enforce message

* added more error checkings

* corrected error report message and optimized code

* corrected findvar usage

* corrected paddle_enforce in scope

* correct error messages

* correct error reporting format
Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com>
Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com>
Co-authored-by: NYUNSHEN XIE <1084314248@qq.com>
Co-authored-by: NBai Yifan <me@ethanbai.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: NWeiXin <weixin10@baidu.com>
Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>

7bbf3ac5

07 12月, 2020 1 次提交
- P
  
  support clip op trt converter (#29411) · f860de4a
  由 Pei Yang 提交于 12月 07, 2020
  
  f860de4a
03 11月, 2020 1 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

28 9月, 2020 1 次提交

Add unittests and OP version registry for tensorrt_subgraph_pass (#27544) · ae6e40a7

由 Pei Yang 提交于 9月 28, 2020

* add unittests and op version register for tensorrt_subgraph_pass

* rename to test_trt_subgraph_pass.py

* fix softmax converter diff when padding dim=1

ae6e40a7

24 9月, 2020 1 次提交

use iwyu clean include (#27267) · df43905f

由 wanghuancoder 提交于 9月 24, 2020

* use iwyu clean include, test=develop, test=win

* compilation error, test=develop

* fix compilation error2, test=develop

* fix compilation error3, test=develop

* fix compilation error4, test=develop

* fix compilation error5, test=develop

* fix compilation error6, test=develop

* fix compilation error7, test=develop

* fix compilation error8, test=develop

* fix compilation error8, test=develop

* fix compilation error10, test=develop

* fix compilation error11, test=develop

df43905f

15 9月, 2020 1 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

01 9月, 2020 1 次提交

[Paddle-TRT] Stack op plugin (#25605) · ad6e3dd6

由 zlsh80826 提交于 9月 01, 2020

* add stack_op to CMakeLists

* add dim=3 support for scale op

* add trt stack op, test=develop

* remove debug message

* add stack plugin serialize

* remove slice, scale op, will add later

* enhence error message

* revise trt ernie test to conver the stack op CI testi, test=develop

* add stack op serialization

* fix test shape after adding stack op

* remove slice op, will add after implementing serialization

* roll back to min_graph=5 to avoid using slice op

* fix scale op output layer

* implement stack op createPlugin

* use workspace and move the defination to .cu

* move stack plugin creator definition to .cu, test=develop

ad6e3dd6

31 8月, 2020 1 次提交
- P
  [Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536) · 78a530c2
  由 Pei Yang 提交于 8月 31, 2020
```
* support trt dynamic shape int8

* add unittest

* add support for sigmoid; adapt to trt6+ api
```
  78a530c2
21 8月, 2020 1 次提交
- P
  
  add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499) · 379222c3
  由 Pei Yang 提交于 8月 21, 2020
  
  379222c3
03 8月, 2020 1 次提交
- P
  
  add trt int8 support for elementwise_mul and scale (#25676) · 9e9a569d
  由 Pei Yang 提交于 8月 03, 2020
  
  9e9a569d
23 6月, 2020 1 次提交

[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097) · b2f5a149

由 Pei Yang 提交于 6月 23, 2020

* Paddle-TensorRT support slim QAT. test=develop

* add comments. test=develop

* use RenameInput instead of ResetInputs. test=develop

b2f5a149

15 5月, 2020 1 次提交
- Z
  fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517) · f68d4fb3
  由 Zhaolong Xing 提交于 5月 15, 2020
```
test=develop
```
  f68d4fb3
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

19 4月, 2020 1 次提交

[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723) · 133f1fc1

由 Zhaolong Xing 提交于 4月 19, 2020

* refine ernie trt dynamic shape support
1. add slice op converter
2. add emb eltwise layernorm fp16 support
test=develop

* fix dynamic shape test ut
test=develop

* fix comments.
test=develop

* fix comments
test=develop

133f1fc1

14 4月, 2020 1 次提交

[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672) · c528f1d4

由 Pei Yang 提交于 4月 14, 2020

* add hard_sigmoid trt op converter

* add hard_swish op converter and plugin. test=develop

* add macro to adapt lower trt version. test=develop

c528f1d4

12 4月, 2020 1 次提交

[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396) · 3acb047a

由 Zhaolong Xing 提交于 4月 12, 2020

* add elementwise pool2d, prelu, shuffle channel
test=develop

* add scale and refine concat eltwise conveter
test=develop

* refine elementwise converter
test=develop

* refine ut test and enforce error.
test=develop

* modify const cast
test=develop

3acb047a

08 4月, 2020 2 次提交
- P
  Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support... · 3d5d2170
  由 Pei Yang 提交于 4月 08, 2020
```
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)

This reverts commit cdc6d4e2.
```
  3d5d2170
- P
  [Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536) · cdc6d4e2
  由 Pei Yang 提交于 4月 08, 2020
```
* add hard_sigmoid trt op converter

* add hard_swish op converter and plugin. test=develop
```
  cdc6d4e2
26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

10 2月, 2020 1 次提交

[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3... · 54a325a5

由 Zhaolong Xing 提交于 2月 10, 2020

[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)

* add int8 op teller for trt.

* refine trt int8

* add int8 op teller for trt.
test=develop

54a325a5

07 1月, 2020 1 次提交
- P
  add TRT support for instance_norm op (#21928) · 50bee83f
  由 Pei Yang 提交于 1月 07, 2020
```
* add TRT support for instance_norm op
```
  50bee83f
06 1月, 2020 1 次提交

Add TRT support for BERT (#21135) · 0a51098a

由 Pei Yang 提交于 1月 06, 2020

* add gelu plugin

* align trt bert with gpu

* add support for fused fc with relu,

* add unittest for bert trt

0a51098a

04 12月, 2019 1 次提交
- Z
  add conv, depthwise_conv, pooling (#20966) · da7748c5
  由 Zhaolong Xing 提交于 12月 04, 2019
```
test=develop
```
  da7748c5
18 11月, 2019 1 次提交
- Z
  TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
  由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
  65f70525
24 7月, 2019 1 次提交

Update trt5 for paddle-trt (#18645) · 26ae6d49

由 Zhaolong Xing 提交于 7月 24, 2019

* update paddle-trt for:
    1. fix bug: when batch > 2, core in split plugin.
    2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.)
    3. add new attr to dropout.
    4. shuffle channel, swish, relu6 support
    test=develop

* 1. fix ci
test=develop

26ae6d49

06 6月, 2019 1 次提交
- Z
  fix: when use the load model from memory mode, the RAM occupy is high (#17788) · ae576f3c
  由 Zhaolong Xing 提交于 6月 06, 2019
```
test=develop
```
  ae576f3c
25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

07 1月, 2019 1 次提交
- Y
  
  refactor tensorrt node teller (#15181) · 6ccf8685
  由 Yan Chunwei 提交于 1月 07, 2019
  
  6ccf8685

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功