提交 · ba82757e69147b521d953383f3efe0852b8052b2 · 机器未来 / Paddle

24 3月, 2021 1 次提交

fix ernie fc shape error (#31803) · ba82757e

由 Shang Zhizhou 提交于 3月 24, 2021

* fix conflict

* fix compile error

* cherry-pick #31316

* Refine cudnn softmax (#25757)

* refine cudnn softmax

* Trt elementwise plugin serialize (#31587)

* add serialize unittest

* fix element_op trt plugin serialize bug

* remove PassVersionChecker.IsCompatible

* fix unittest
Co-authored-by: NPei Yang <peiyang@baidu.com>
Co-authored-by: NGaoWei8 <53294385+GaoWei8@users.noreply.github.com>

ba82757e

07 12月, 2020 1 次提交

cherry-pick PR #27933 (#29377) · 9a6ecb03

由 Shang Zhizhou 提交于 12月 07, 2020

* cherry-pick PR #27933

* fix: cuda version is in varibale CUDA_VERSION in 1.8 cuda.cmake

* close unittest failed temporarily

* cherry-pick PR #27544, fix layer_norm and softmax bug in tensorRT

9a6ecb03

10 2月, 2020 1 次提交

[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3... · 54a325a5

由 Zhaolong Xing 提交于 2月 10, 2020

[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)

* add int8 op teller for trt.

* refine trt int8

* add int8 op teller for trt.
test=develop

54a325a5

25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

12 11月, 2018 1 次提交
- N
  
  add serial to trt test and do not print log for unused trt logs · d6ff0069
  由 nhzlx 提交于 11月 12, 2018
  
  d6ff0069
08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
09 8月, 2018 1 次提交
- N
  
  add softmax op converter · 641f32da
  由 nhzlx 提交于 8月 09, 2018
  
  641f32da
25 7月, 2018 1 次提交
- L
  
  unify libpaddle_inference_api into libpaddle_fluid · 5ba43376
  由 Luo Tao 提交于 7月 25, 2018
  
  5ba43376
24 7月, 2018 2 次提交
- N
  
  fix comments · 4d49e61a
  由 nhzlx 提交于 7月 24, 2018
  
  4d49e61a
- N
  
  1. set ut batch > 1 2. readd the mul op(utest will be added later) · 7382f986
  由 nhzlx 提交于 7月 24, 2018
  
  7382f986
07 6月, 2018 2 次提交
- L
  
  add test_mode in trt/activation_op · f6fb51a1
  由 Luo Tao 提交于 6月 07, 2018
  
  f6fb51a1
- Y
  
  feature/trt engine op test (#11182) · 4f95bc94
  由 Yan Chunwei 提交于 6月 07, 2018
  
  4f95bc94
06 6月, 2018 1 次提交
- L
  
  rewrite unittest of trt_activation_op · e116129f
  由 Luo Tao 提交于 6月 06, 2018
  
  e116129f
01 6月, 2018 1 次提交
- F
  
  fix compile errors · 31f0533c
  由 fengjiayi 提交于 6月 01, 2018
  
  31f0533c
14 5月, 2018 1 次提交
- Y
  
  OpConverter change BlockDesc to proto::BlockDesc (#10623) · 674bd839
  由 Yan Chunwei 提交于 5月 14, 2018
  
  674bd839
03 5月, 2018 1 次提交
- L
  
  add relu converter and unit-test · beb12455
  由 Luo Tao 提交于 5月 03, 2018
  
  beb12455
27 4月, 2018 1 次提交
- L
  
  update the register method · 6f6f3304
  由 Luo Tao 提交于 4月 27, 2018
  
  6f6f3304
25 4月, 2018 2 次提交
- L
  
  use template to do registry · c4e3010b
  由 Luo Tao 提交于 4月 25, 2018
  
  c4e3010b
- L
  
  auto registray op converters · d599de5c
  由 Luo Tao 提交于 4月 25, 2018
  
  d599de5c
23 4月, 2018 1 次提交
- L
  
  tensorrt convert init · 42febfa9
  由 Luo Tao 提交于 4月 23, 2018
  
  42febfa9
26 2月, 2018 2 次提交
- X
  
  Fix version date. · 9bbce493
  由 Xin Pan 提交于 2月 26, 2018
  
  9bbce493
- X
  
  Extend current profiler for timeline and more features. · b9ec24c6
  由 Xin Pan 提交于 2月 24, 2018
  
  b9ec24c6
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
09 1月, 2018 1 次提交

Port WarpCTC Operator (#5107) · b5fda272

由 Yiqun Liu 提交于 1月 09, 2018

* Add Seq2BatchFunctor, which will be used in WarpCTCOp.

* Implement WrapCTCFunctor and WrapCTCKernel.

* Add unittest of warpctc_op.

* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.

* Use absolute offset lod in warpctc_op and related functors.

* Refine the comments of warpctc_op.

* The new python unittest supports checking a subset of the outputs, so revoke the previous change.

* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.

* Update to the newest codes.

* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.

b5fda272

04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
11 7月, 2017 1 次提交
- Y
  
  Refine CUDA Related libraries · a0466053
  由 Yu Yang 提交于 7月 11, 2017
  
  a0466053

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致