1. 30 3月, 2021 1 次提交
    • S
      Scale 1.8 (#31940) · c36c22fe
      Shang Zhizhou 提交于
      * add n-d input support for trt scale converter (#31316)
      
      * add n-d input support for trt scale converter
      
      * add flatten for ut
      
      * fix dims
      
      * fix batchnorm when inpu dims < 3 (#31933)
      
      * fix batchnorm when inpu dims < 3
      
      * add unittest for batchnorm dims = 2
      
      * fix unittest
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      c36c22fe
  2. 24 3月, 2021 1 次提交
  3. 11 3月, 2021 1 次提交
  4. 04 1月, 2021 1 次提交
  5. 07 12月, 2020 1 次提交
    • S
      cherry-pick PR #27933 (#29377) · 9a6ecb03
      Shang Zhizhou 提交于
      * cherry-pick PR #27933
      
      * fix: cuda version is in varibale CUDA_VERSION in 1.8 cuda.cmake
      
      * close unittest failed temporarily
      
      * cherry-pick PR #27544, fix layer_norm and softmax bug in tensorRT
      9a6ecb03
  6. 03 12月, 2020 2 次提交
  7. 01 12月, 2020 1 次提交
  8. 25 11月, 2020 1 次提交
  9. 20 11月, 2020 1 次提交
  10. 13 11月, 2020 1 次提交
    • S
      Skip layernorm to 1.8 (#28583) · ec672e88
      Shang Zhizhou 提交于
      * 裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
      
      * skip_layernorm_op done
      
      * add unittest
      
      * slice op convertor support trt < 6
      
      * skip_layernorm only work in ernie
      
      * fix unittest
      
      * fix unittest
      ec672e88
  11. 09 11月, 2020 1 次提交
  12. 05 11月, 2020 1 次提交
    • S
      Ernie varlen to 1.8 (#28400) · 78d68d59
      Shang Zhizhou 提交于
      * Fix TRT plugin registry without TRT lib (#25982)
      
      * fix trt plugin registry without trt lib
      
      * support trt4
      
      * refine code style
      
      * pick ea851796 from develop
      
      * cherry-pick develop PR  #26273 && #27796
      
      * fix unittest error
      
      * fix unittest error
      
      * remove const_cast
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      78d68d59
  13. 23 9月, 2020 1 次提交
    • P
      Optimize slice trt plugin (#26970) (#27456) · 8e1712a7
      Pei Yang 提交于
      * optimize slice TRT plugin
      
      This patch removes unnecessary barrier for data transfer of needed offset,
      so data transfer can be overlap with GPU kernel execution.
      
      This patch also fixes incorrect name of slice plugin. That is, replaces
      "layernorm" with "slice"
      
      test=develop
      
      * add serialize/deserialize to slice plugin
      
      * add static shape slice trt plugin
      
      * fix slice trt op convertor dynamic shape bug
      
      * fix format by clang-format
      
      * fix pylint format error
      
      * fix problems commented by peiyang
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com>
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      8e1712a7
  14. 21 9月, 2020 1 次提交
  15. 18 9月, 2020 1 次提交
    • P
      [cherry-pick][Paddle-TRT] Stack op plugin (#25605) (#27365) · 4283be52
      Pei Yang 提交于
      * [Paddle-TRT] Stack op plugin (#25605)
      
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      
      * sync ut with develop
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      4283be52
  16. 01 9月, 2020 1 次提交
  17. 11 8月, 2020 1 次提交
  18. 07 8月, 2020 2 次提交
  19. 06 8月, 2020 3 次提交
  20. 04 8月, 2020 2 次提交
  21. 30 7月, 2020 1 次提交
    • Cherry-pick of lite engine, test=release/1.8 (#25817) · 45fa6861
      石晓伟 提交于
      * ignore warnings of external libraries, test=develop (#24193)
      
      * fix repeat definitions in liengine.cc, test=develop (#25020)
      
      * remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
      
      * fix compile for lite subgraph. test=develop (#25285)
      
      * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
      
      * supports xpu runtime, test=develop (#25554)
      
      * fix cmake of lite, test=develop (#25680)
      
      * change commit files, test=release/1.8
      Co-authored-by: NWilber <jiweibo@baidu.com>
      45fa6861
  22. 27 7月, 2020 1 次提交
  23. 06 7月, 2020 1 次提交
  24. 01 7月, 2020 2 次提交
  25. 15 5月, 2020 1 次提交
  26. 30 4月, 2020 1 次提交
  27. 25 4月, 2020 1 次提交
  28. 24 4月, 2020 1 次提交
  29. 23 4月, 2020 2 次提交
    • L
      1b45847e
    • Z
      [Cherry-pick]: 23974, 23723, 23984 (#24084) · 26a1def9
      Zhaolong Xing 提交于
      * Chery_pick:[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
      
      * refine ernie trt dynamic shape support
      1. add slice op converter
      2. add emb eltwise layernorm fp16 support
      test=develop
      
      * fix dynamic shape test ut
      test=develop
      
      * fix comments.
      test=develop
      
      * fix comments
      test=develop
      
      * cherry-pick [BUG]: Head number can only be > 1 on multihead op (#23974)
      
      * support the head number == 1
      test=develop
      
      * fix slice op error.
      test=develop
      
      * cherry-pick :disable trt test, test=develop (#23984)
      
      test=release/2.0-beta
      26a1def9
  30. 21 4月, 2020 1 次提交
  31. 20 4月, 2020 1 次提交
  32. 17 4月, 2020 2 次提交