提交 · 372ac08a171d76c745deaab0feed2d587798f734 · PaddlePaddle / Paddle

04 2月, 2021 1 次提交
- W
  use iwyu clean include second time, test=develop (#30829) · 35c5b23f
  由 wanghuancoder 提交于 2月 04, 2021
```
* use iwyu clean include second time, test=develop
```
  35c5b23f
25 1月, 2021 1 次提交

add DLA support：C++&&Python api (#30165) · ae0f88a9

由 Shang Zhizhou 提交于 1月 25, 2021

* add dla

* add dla done

* add python api
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>

ae0f88a9

19 1月, 2021 1 次提交
- L
  unify calling cudaSetDevice (#30470) · 81217a94
  由 Leo Chen 提交于 1月 19, 2021
```
* unify calling cudaSetDevice

* fix compile
```
  81217a94
24 9月, 2020 1 次提交

use iwyu clean include (#27267) · df43905f

由 wanghuancoder 提交于 9月 24, 2020

* use iwyu clean include, test=develop, test=win

* compilation error, test=develop

* fix compilation error2, test=develop

* fix compilation error3, test=develop

* fix compilation error4, test=develop

* fix compilation error5, test=develop

* fix compilation error6, test=develop

* fix compilation error7, test=develop

* fix compilation error8, test=develop

* fix compilation error8, test=develop

* fix compilation error10, test=develop

* fix compilation error11, test=develop

df43905f

14 9月, 2020 1 次提交
- P
  
  refine error message related to paddle-TRT (#27256) · aae41c6f
  由 Pei Yang 提交于 9月 14, 2020
  
  aae41c6f
31 8月, 2020 1 次提交
- P
  [Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536) · 78a530c2
  由 Pei Yang 提交于 8月 31, 2020
```
* support trt dynamic shape int8

* add unittest

* add support for sigmoid; adapt to trt6+ api
```
  78a530c2
28 7月, 2020 1 次提交
- P
  
  add macro check for using TRT api dynamicRangeIsSet() (#25694) · eef98b7f
  由 Pei Yang 提交于 7月 28, 2020
  
  eef98b7f
23 6月, 2020 1 次提交

[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097) · b2f5a149

由 Pei Yang 提交于 6月 23, 2020

* Paddle-TensorRT support slim QAT. test=develop

* add comments. test=develop

* use RenameInput instead of ResetInputs. test=develop

b2f5a149

15 6月, 2020 1 次提交

bugfix for unique_ptr of IOptimizationProfile (#23917) · bef4afa6

由 Jeng Bai-Cheng 提交于 6月 15, 2020

This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile.

IOptimizationProfile has protected dtor and is controlled by TensorRT
internally. Application shouldn't delete the pointer of IOptimizationProfile.
See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a
test=develop

bef4afa6

01 4月, 2020 1 次提交
- Z
  add swish split gelu plugin dynamic support (#23305) · 1a6ce8b9
  由 Zhaolong Xing 提交于 4月 01, 2020
```
test=develop
```
  1a6ce8b9
26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

09 3月, 2020 1 次提交

[Paddle-TRT] : (Part1) Dynamic shape support (#22868) · dd67d44a

由 Zhaolong Xing 提交于 3月 09, 2020

* change the ci trt from version 5. to 6.0

* paddle-trt dynamic shape support init

* conv+bias or conv+bn dynamic shape support
test=develop

* modity trt engine opconvert
test=develop

* fix ci error
test=develop

dd67d44a

23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
05 2月, 2020 1 次提交
- Z
  [Fix BUG]: Core when multi thread + clone + paddle-trt (#22442) · ceda0b9b
  由 Zhaolong Xing 提交于 2月 05, 2020
```
* add mutex for trt engine
test=develop

* add the test for copy_to_cpu
test=develop
```
  ceda0b9b
20 11月, 2019 1 次提交
- P
  fix trt weight bug (#21231) · 2e2f92a5
  由 Pei Yang 提交于 11月 20, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
  2e2f92a5
18 11月, 2019 1 次提交
- Z
  TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
  由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
  65f70525
21 9月, 2019 1 次提交
- P
  Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825) · 74812d1c
  由 Pei Yang 提交于 9月 21, 2019
```
* fix trt bugs when sharing params, test=develop

* add unittest for cascade_rcnn
```
  74812d1c
20 9月, 2019 1 次提交
- 石
  
  fix multi-thread exec of trt, test=develop (#19338) · d004a0f5
  由石晓伟提交于 9月 20, 2019
  
  d004a0f5
19 8月, 2019 1 次提交

Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213) · 76c95af0

由 Zhaolong Xing 提交于 8月 19, 2019

* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop

* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop

76c95af0

02 8月, 2019 1 次提交

Fix the CE error which caused by paddle-trt version (#18941) · 3816d221

由 Zhaolong Xing 提交于 8月 02, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

* fix trt fp16 ce error
test=develop

* add an vlog if the user use trt4 and specify fp16.
test=develop

3816d221

31 7月, 2019 1 次提交

Trt fp16 support (#18860) · 61238d31

由 Zhaolong Xing 提交于 7月 31, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

61238d31

25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

08 3月, 2019 3 次提交
- N
  5. add static trt load model · f3d164fa
  由 nhzlx 提交于 2月 22, 2019
```
1). add static trt load model
2). fix bug: when device_id is not 0, the trt will have a bug
test=develop
```
  f3d164fa
- N
  
  2. TRTEngine using stream only when execute. · 8c171902
  由 nhzlx 提交于 2月 14, 2019
  
  8c171902
- N
  add static model load for trt · 88c24baa
  由 nhzlx 提交于 2月 14, 2019
```
1. bind trt input and output to fluid tensors
```
  88c24baa
22 2月, 2019 1 次提交

5. add static trt load model · 1d5ef7c9

由 nhzlx 提交于 2月 22, 2019

1). add static trt load model
2). fix bug: when device_id is not 0, the trt will have a bug
test=develop

1d5ef7c9

14 2月, 2019 2 次提交
- N
  
  2. TRTEngine using stream only when execute. · 9cc6249c
  由 nhzlx 提交于 2月 14, 2019
  
  9cc6249c
- N
  add static model load for trt · 034ba1c2
  由 nhzlx 提交于 2月 14, 2019
```
1. bind trt input and output to fluid tensors
```
  034ba1c2
22 1月, 2019 1 次提交

fix trt stream bug. · ec213730

由 nhzlx 提交于 1月 22, 2019

BUG: After continuing to input different data, the output cannot be aligned
test=develop

ec213730

16 1月, 2019 1 次提交
- N
  add trt int8 calibration support · 312fe0ec
  由 nhzlx 提交于 1月 16, 2019
```
fix comments

test=develop
```
  312fe0ec
09 1月, 2019 1 次提交
- N
  add trt int8 support · 4e3522e5
  由 nhzlx 提交于 1月 09, 2019
```
test=develop
```
  4e3522e5
20 11月, 2018 1 次提交
- Y
  Implement the Tensorrt plugin for elementwise op (#14487) · 8bc1c5d2
  由 Yiqun Liu 提交于 11月 20, 2018
```
* Initialize the elementwise plugin.

* Implement the basic CUDA kernel of elementwise plugin.
test=develop
```
  8bc1c5d2
16 11月, 2018 1 次提交
- H
  
  Complete PRelu plugin and Conv2d transpose op converter · 21f33b42
  由 hjchen2 提交于 11月 15, 2018
  
  21f33b42
14 11月, 2018 1 次提交
- Y
  
  Combine Inference Analysis with IR (#13914) · 9f252e00
  由 Yan Chunwei 提交于 11月 14, 2018
  
  9f252e00
13 11月, 2018 2 次提交
- N
  fix comments · 0b962680
  由 nhzlx 提交于 11月 13, 2018
```
test=develop
```
  0b962680
- N
  
  add plugin support and offer an simple split sample · d38fd6a0
  由 nhzlx 提交于 11月 13, 2018
  
  d38fd6a0
06 11月, 2018 1 次提交
- N
  
  fix comments and fix bug · 86b99ac9
  由 nhzlx 提交于 11月 06, 2018
  
  86b99ac9
17 8月, 2018 1 次提交
- N
  
  1. change tensorrt op from cpu to gpu · 1600ba86
  由 nhzlx 提交于 8月 17, 2018
  
  1600ba86
24 7月, 2018 1 次提交
- N
  
  add assert for GetOutput · bcd67bdd
  由 nhzlx 提交于 7月 24, 2018
  
  bcd67bdd
23 7月, 2018 1 次提交
- N
  
  there is no batchsize concept in tensorrt's tensor · 2372daff
  由 nhzlx 提交于 7月 23, 2018
  
  2372daff

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功