1. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  2. 29 10月, 2020 1 次提交
  3. 26 10月, 2020 1 次提交
  4. 21 10月, 2020 1 次提交
  5. 20 10月, 2020 1 次提交
  6. 13 10月, 2020 2 次提交
  7. 10 10月, 2020 1 次提交
  8. 24 9月, 2020 3 次提交
    • W
      Fix memory leak for mkldnn. (#27493) · df7fabee
      Wilber 提交于
      df7fabee
    • W
      windows lib size crop from 5.4G to 3.9G (#27477) · ec4155d7
      Wilber 提交于
      ec4155d7
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  9. 23 9月, 2020 2 次提交
  10. 22 9月, 2020 1 次提交
  11. 21 9月, 2020 1 次提交
  12. 16 9月, 2020 1 次提交
  13. 14 9月, 2020 1 次提交
  14. 11 9月, 2020 2 次提交
  15. 10 9月, 2020 1 次提交
  16. 09 9月, 2020 1 次提交
  17. 07 9月, 2020 1 次提交
  18. 02 9月, 2020 1 次提交
  19. 01 9月, 2020 2 次提交
    • Z
      [Paddle-TRT] Stack op plugin (#25605) · ad6e3dd6
      zlsh80826 提交于
      * add stack_op to CMakeLists
      
      * add dim=3 support for scale op
      
      * add trt stack op, test=develop
      
      * remove debug message
      
      * add stack plugin serialize
      
      * remove slice, scale op, will add later
      
      * enhence error message
      
      * revise trt ernie test to conver the stack op CI testi, test=develop
      
      * add stack op serialization
      
      * fix test shape after adding stack op
      
      * remove slice op, will add after implementing serialization
      
      * roll back to min_graph=5 to avoid using slice op
      
      * fix scale op output layer
      
      * implement stack op createPlugin
      
      * use workspace and move the defination to .cu
      
      * move stack plugin creator definition to .cu, test=develop
      ad6e3dd6
    • Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854) · ced6e87e
      石晓伟 提交于
      This reverts commit 02083bda.
      ced6e87e
  20. 29 8月, 2020 1 次提交
  21. 28 8月, 2020 2 次提交
  22. 25 8月, 2020 1 次提交
  23. 08 8月, 2020 1 次提交
  24. 22 7月, 2020 1 次提交
    • supports xpu runtime, test=develop (#25554) · 72064172
      石晓伟 提交于
      * update ResetHolder, test=develop
      
      * add TensorShare for lite engine, test=develop
      
      * tensor data changed from copying to sharing, test=develop
      
      * supports xpu runtime, test=develop
      
      * fix code styles, test=develop
      72064172
  25. 21 7月, 2020 1 次提交
  26. 14 7月, 2020 1 次提交
  27. 02 7月, 2020 1 次提交
  28. 30 6月, 2020 1 次提交
  29. 23 6月, 2020 1 次提交
  30. 10 6月, 2020 1 次提交
  31. 09 6月, 2020 1 次提交
  32. 08 6月, 2020 1 次提交
  33. 05 6月, 2020 1 次提交