1. 28 9月, 2020 1 次提交
  2. 27 9月, 2020 2 次提交
  3. 23 9月, 2020 1 次提交
    • P
      Optimize slice trt plugin (#26970) (#27456) · 8e1712a7
      Pei Yang 提交于
      * optimize slice TRT plugin
      
      This patch removes unnecessary barrier for data transfer of needed offset,
      so data transfer can be overlap with GPU kernel execution.
      
      This patch also fixes incorrect name of slice plugin. That is, replaces
      "layernorm" with "slice"
      
      test=develop
      
      * add serialize/deserialize to slice plugin
      
      * add static shape slice trt plugin
      
      * fix slice trt op convertor dynamic shape bug
      
      * fix format by clang-format
      
      * fix pylint format error
      
      * fix problems commented by peiyang
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com>
      Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
      8e1712a7
  4. 21 9月, 2020 2 次提交
  5. 18 9月, 2020 1 次提交
    • P
      [cherry-pick][Paddle-TRT] Stack op plugin (#25605) (#27365) · 4283be52
      Pei Yang 提交于
      * [Paddle-TRT] Stack op plugin (#25605)
      
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      
      * sync ut with develop
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      4283be52
  6. 16 9月, 2020 1 次提交
  7. 14 9月, 2020 1 次提交
  8. 11 9月, 2020 1 次提交
  9. 10 9月, 2020 1 次提交
  10. 03 9月, 2020 1 次提交
  11. 02 9月, 2020 2 次提交
  12. 01 9月, 2020 1 次提交
  13. 24 8月, 2020 1 次提交
  14. 19 8月, 2020 1 次提交
  15. 18 8月, 2020 1 次提交
  16. 12 8月, 2020 1 次提交
  17. 11 8月, 2020 1 次提交
  18. 10 8月, 2020 1 次提交
  19. 07 8月, 2020 3 次提交
  20. 06 8月, 2020 5 次提交
  21. 05 8月, 2020 3 次提交
  22. 04 8月, 2020 3 次提交
  23. 30 7月, 2020 2 次提交
    • Cherry-pick of lite engine, test=release/1.8 (#25817) · 45fa6861
      石晓伟 提交于
      * ignore warnings of external libraries, test=develop (#24193)
      
      * fix repeat definitions in liengine.cc, test=develop (#25020)
      
      * remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
      
      * fix compile for lite subgraph. test=develop (#25285)
      
      * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
      
      * supports xpu runtime, test=develop (#25554)
      
      * fix cmake of lite, test=develop (#25680)
      
      * change commit files, test=release/1.8
      Co-authored-by: NWilber <jiweibo@baidu.com>
      45fa6861
    • C
      Fix index overflow bug of the CUDA kernel loop increment (#25435) (#25727) · e947d11e
      Chen Weihang 提交于
      * fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop
      
      * replace old macro & for condition, test=develop
      
      * polish details, test=develop
      e947d11e
  24. 29 7月, 2020 1 次提交
  25. 27 7月, 2020 2 次提交
    • A
      Fix FC + GRU fuse pass (#25733) · 71e350c5
      Adam 提交于
      71e350c5
    • C
      [Cherry-pick] Fix Cudnn lib load problem & polish install error message (#25706) · 2d7e7759
      Chen Weihang 提交于
      * Add default cudnn lib path (#25175)
      
      * add default cudnn lib path, test=develop
      
      * change default path in func, test=develop
      
      * move to linux branch, test=develop
      
      * fix var error in other plat, test=develop
      
      * Refactor dynamic dso search functions (#25214)
      
      * refactor dynamic dso search func, test=develop
      
      * polish details, test=develop
      
      * polish detail based review comments, test=develop
      
      * revert string type change, test=develop
      
      * Polish install error hint message (#25531)
      
      * polish install error hint msg, test=develop
      
      * fix variable error, test=develop
      
      * polish hint messgae again
      2d7e7759