提交 · 6b10c0e5dc83113a1102984f0bfd7edddf121db9 · PaddlePaddle / Paddle

07 8月, 2023 1 次提交

[Inference] save_optimized_model_pass support tensorrt (#55893) · 6b10c0e5

由 Yuanle Liu 提交于 8月 07, 2023

* fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward

* save_optimized_model_pass support tensorrt

* update

* update

* fix compile

* update

* fix ut timeout

6b10c0e5

11 10月, 2022 1 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
05 8月, 2022 1 次提交

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

18 1月, 2022 1 次提交

[Unify Tensors PR #8] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

15 9月, 2020 1 次提交

Optimize error report (#27254) · e6e2e537

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize errror report

* add test case for pad op converter

* fix some spelling mistake commented by peiyang

e6e2e537

23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
31 7月, 2019 1 次提交

Trt fp16 support (#18860) · 61238d31

由 Zhaolong Xing 提交于 7月 31, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

61238d31

25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

23 5月, 2019 1 次提交
- Z
  fix trt ci bug temporary. (#17565) · 38da1030
  由 Zhaolong Xing 提交于 5月 23, 2019
```
ban all trt ut. will fix it later.

test=develop
```
  38da1030
08 3月, 2019 4 次提交
- N
  fix comments and fix cpplint · 4b59646e
  由 nhzlx 提交于 2月 27, 2019
```
test=develop
```
  4b59646e
- N
  5. add static trt load model · f3d164fa
  由 nhzlx 提交于 2月 22, 2019
```
1). add static trt load model
2). fix bug: when device_id is not 0, the trt will have a bug
test=develop
```
  f3d164fa
- N
  
  2. TRTEngine using stream only when execute. · 8c171902
  由 nhzlx 提交于 2月 14, 2019
  
  8c171902
- N
  add static model load for trt · 88c24baa
  由 nhzlx 提交于 2月 14, 2019
```
1. bind trt input and output to fluid tensors
```
  88c24baa
27 2月, 2019 1 次提交
- N
  fix comments and fix cpplint · 06a088a1
  由 nhzlx 提交于 2月 27, 2019
```
test=develop
```
  06a088a1
22 2月, 2019 1 次提交

5. add static trt load model · 1d5ef7c9

由 nhzlx 提交于 2月 22, 2019

1). add static trt load model
2). fix bug: when device_id is not 0, the trt will have a bug
test=develop

1d5ef7c9

14 2月, 2019 2 次提交
- N
  
  2. TRTEngine using stream only when execute. · 9cc6249c
  由 nhzlx 提交于 2月 14, 2019
  
  9cc6249c
- N
  add static model load for trt · 034ba1c2
  由 nhzlx 提交于 2月 14, 2019
```
1. bind trt input and output to fluid tensors
```
  034ba1c2
22 1月, 2019 1 次提交

fix trt stream bug. · ec213730

由 nhzlx 提交于 1月 22, 2019

BUG: After continuing to input different data, the output cannot be aligned
test=develop

ec213730

20 11月, 2018 1 次提交
- Y
  Implement the Tensorrt plugin for elementwise op (#14487) · 8bc1c5d2
  由 Yiqun Liu 提交于 11月 20, 2018
```
* Initialize the elementwise plugin.

* Implement the basic CUDA kernel of elementwise plugin.
test=develop
```
  8bc1c5d2
21 8月, 2018 1 次提交
- N
  
  add comments for execute in ut_helper · c6a5c4b0
  由 nhzlx 提交于 8月 21, 2018
  
  c6a5c4b0
20 8月, 2018 1 次提交
- N
  
  fix comments · 1bf9d9e9
  由 nhzlx 提交于 8月 20, 2018
  
  1bf9d9e9
18 8月, 2018 1 次提交
- N
  
  add batch norm op converter · 144b20c1
  由 nhzlx 提交于 8月 18, 2018
  
  144b20c1
17 8月, 2018 1 次提交
- N
  
  1. change tensorrt op from cpu to gpu · 1600ba86
  由 nhzlx 提交于 8月 17, 2018
  
  1600ba86
09 8月, 2018 1 次提交
- N
  
  add softmax op converter · 641f32da
  由 nhzlx 提交于 8月 09, 2018
  
  641f32da
01 8月, 2018 1 次提交
- N
  
  increase the test batch · 64a08f84
  由 nhzlx 提交于 8月 01, 2018
  
  64a08f84
26 7月, 2018 1 次提交
- N
  
  fix a bug · 4f71a3b1
  由 nhzlx 提交于 7月 26, 2018
  
  4f71a3b1
25 7月, 2018 2 次提交
- N
  
  fix comments · 55334007
  由 nhzlx 提交于 7月 25, 2018
  
  55334007
- N
  
  1. support mutil batch utest 2. support pool op · 01566fb6
  由 nhzlx 提交于 7月 25, 2018
  
  01566fb6
24 7月, 2018 1 次提交
- N
  
  add assert for GetOutput · bcd67bdd
  由 nhzlx 提交于 7月 24, 2018
  
  bcd67bdd
23 7月, 2018 1 次提交
- N
  
  1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface · 82527696
  由 nhzlx 提交于 7月 23, 2018
  
  82527696
03 7月, 2018 2 次提交
- X
  
  fix · d70a38d8
  由 Xin Pan 提交于 7月 03, 2018
  
  d70a38d8
- X
  
  hide utils to legacy · 94cb59ad
  由 Xin Pan 提交于 7月 03, 2018
  
  94cb59ad
11 6月, 2018 1 次提交
- G
  
  Add brpc surpport. (#11263) · d9de6b86
  由 gongweibao 提交于 6月 11, 2018
  
  d9de6b86
08 6月, 2018 1 次提交
- Y
  
  loose threshold of TRT for CI in different model (#11305) · 145aaa4b
  由 Yan Chunwei 提交于 6月 08, 2018
  
  145aaa4b
07 6月, 2018 1 次提交
- Y
  
  feature/trt engine op test (#11182) · 4f95bc94
  由 Yan Chunwei 提交于 6月 07, 2018
  
  4f95bc94
01 6月, 2018 1 次提交
- Y
  
  feature/add TRT fc converter (#11043) · 0c0c5df4
  由 Yan Chunwei 提交于 6月 01, 2018
  
  0c0c5df4

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功