1. 20 10月, 2021 1 次提交
  2. 11 10月, 2021 3 次提交
  3. 30 9月, 2021 1 次提交
  4. 17 9月, 2021 2 次提交
  5. 08 9月, 2021 1 次提交
    • S
      Slice bug (#35357) · 28abd5d8
      Shang Zhizhou 提交于
      * update slice plugin
      
      * add test
      
      * fix code style
      
      * fix trt6
      
      * update test
      
      * fix test
      
      * add timeout
      
      * update trt version
      
      * update cmake
      28abd5d8
  6. 23 8月, 2021 1 次提交
  7. 20 7月, 2021 1 次提交
    • Z
      [Paddle-TRT] Add noexcept on methods inherited from TensorRT (#34157) · b5aab4f0
      zlsh80826 提交于
      * add trt noexcept definition
      
      * add trt noexcept on trt plugin
      
      * add trt noexcept on trt int8 calibrator
      
      * remove noexcept on base serialize
      
      * add trt noexcept on split plugin
      
      * add trt noexcept on elementwise plugin
      
      * add trt noexcept on prelu plugin
      
      * add trt noexcept on pool plugin
      
      * add trt noexcept on swish plugin
      
      * add trt noexcept on gelu plugin
      
      * add trt noexcept on layer norm plugin
      
      * add trt noexcept on instance norm plugin
      
      * add trt noexcept on emb eltwise layernorm plugin
      
      * add trt noexcept on qkv2context plugin
      
      * add trt noexcept on skip layernorm plugin
      
      * add trt noexcept on slice plugin
      
      * add trt noexcept on hard swish plugin
      
      * add trt noexcept on stack plugin
      
      * add trt noexcept on special slice plugin
      
      * add trt noexcept on anchor generator plugin
      
      * add trt noexcept on yolobox plugin
      
      * add trt noexcept on roi align plugin
      
      * add trt noexcept on gather nd plugin
      b5aab4f0
  8. 12 7月, 2021 1 次提交
    • Z
      [Paddle-TRT] IPluginExt -> IPluginV2 (#33680) · 394f92aa
      zlsh80826 提交于
      * add trt LT version helper
      
      * upgrade PluginTensorRT to IPluginV2Ext
      
      * trt plugin factory is not usable in IPluginV2
      
      * upgrade add plugin api to use IPluginV2
      
      * remove IPlugin register and adapt getSerializeSize(), serialize()
      
      * adapt IPluginV2Layer
      
      * downgrade to IPluginV2
      
      * implement elementwise clone
      
      * add gelu plugin creator and fix gelu serialization bug
      
      * add swish plugin creator and fix swish serialization bug
      
      * format
      
      * fix typo
      
      * add elementwise plugin creator and fix serialization
      
      * add base creator class
      
      * add gelu plugin creator
      
      * add hard swish creator and fix serialization
      
      * add instance norm creator and fix serialization
      
      * add layer norm creator and fix serialization
      
      * add pool creator and fix serialization
      
      * add prelu creator and fix serialization
      
      * add slice creator and fix serialization
      
      * add swish creator and fix serialization
      
      * add instance norm op unittest
      
      * remove redundent api
      
      * fix wrong graph size to enable trt
      
      * instance norm function move to cc
      
      * add trt elementwise ut to trigger coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove unused code
      
      * remove unused inputs_
      
      * add dbg info
      
      * remove dbg info
      
      * add instance norm serialization
      
      * roll back
      
      * remove comment code
      
      * remove trt plugin registery
      
      * fix prelu dynamic serialization
      
      * add prelu ut and reduce the input size to reduce memory usage
      
      * fix pool dynamic plugin serialization and add ut
      
      * refine pool ut with subtest
      
      * add env for avoiding oom
      
      * reduce test input size & increase pool op ut to 45s
      
      * add the contributor
      
      * remove copyright (will add in contributor)
      
      * remove copyright (will add in contributor)
      394f92aa
  9. 09 7月, 2021 1 次提交
  10. 28 6月, 2021 1 次提交
  11. 25 6月, 2021 1 次提交
  12. 24 6月, 2021 1 次提交
  13. 21 6月, 2021 1 次提交
  14. 15 6月, 2021 1 次提交
  15. 08 6月, 2021 1 次提交
    • S
      add dynamic layer_norm plugin (#33293) · 45d1ae21
      Shang Zhizhou 提交于
      * add dynamic layer_norm plugin
      
      * fix bug
      
      * fix numpy.allclose
      
      * fix format
      
      * fix code style
      
      * remove shepe in dynamic shape
      
      * code format
      
      * remove layer norm fp16
      
      * fix format
      45d1ae21
  16. 05 6月, 2021 1 次提交
  17. 29 4月, 2021 1 次提交
  18. 06 4月, 2021 1 次提交
  19. 02 4月, 2021 2 次提交
  20. 01 4月, 2021 1 次提交
    • Z
      [Paddle-TRT] add anchor generator op plugin (#31730) · b807e408
      zlsh80826 提交于
      * add anchor generator op plugin
      
      * add anchor generator unit_test
      
      * remove dbg info
      
      * remove redundant line
      
      * replace assertion with paddle enforce
      
      * dynamic plugin replaces assertion with paddle enforce
      
      * anchor generator support dynamic shape on spatial axis
      
      * anchor generator test with fp16, dynamic shape
      
      * add anchor generator test all
      
      * add back main
      
      * reduce test input size to not exceed the timelimit of ci
      
      * change super to InferencePassTest for python2 compatibility
      
      * reuse paddle operator anchor generator
      
      * move creator construct to header with default
      
      * add cuda ifdef
      
      * reduce line
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix anchor generator fp16 serialize setting
      
      * split unittest from test_all
      
      * restrict anchor generator input format before version 7234
      
      * anchor generator only support greater than trt7.1
      
      * change min_graph_size to 2
      
      * min_graph size to 3 if dynamic shape
      
      * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit
      
      * remove anchor from fetch list
      
      * anchor generator support all trt version
      
      * fix memory not allocated but if serialized
      b807e408
  21. 30 3月, 2021 1 次提交
  22. 29 3月, 2021 1 次提交
    • Z
      [Paddle-TRT] roi_align_plugin (#31732) · e3a38d79
      zlsh80826 提交于
      * add roi_align_plugin
      
      * add roi align unit_test
      
      * add roi align serialization
      
      * remove roi align static plugin because of batch dim issue
      
      * refine roi align unittest and add fp16/serialization
      
      * add trt roi align condition to op_teller
      
      * refine error message
      
      * remove unnecessary reshape layer
      e3a38d79
  23. 23 3月, 2021 2 次提交
  24. 12 3月, 2021 1 次提交
  25. 10 3月, 2021 1 次提交
  26. 02 3月, 2021 1 次提交
  27. 04 2月, 2021 2 次提交
  28. 02 2月, 2021 1 次提交
  29. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  30. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  31. 28 9月, 2020 1 次提交
  32. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  33. 23 9月, 2020 1 次提交
  34. 22 9月, 2020 1 次提交