1. 20 7月, 2021 1 次提交
    • Z
      [Paddle-TRT] Add noexcept on methods inherited from TensorRT (#34157) · b5aab4f0
      zlsh80826 提交于
      * add trt noexcept definition
      
      * add trt noexcept on trt plugin
      
      * add trt noexcept on trt int8 calibrator
      
      * remove noexcept on base serialize
      
      * add trt noexcept on split plugin
      
      * add trt noexcept on elementwise plugin
      
      * add trt noexcept on prelu plugin
      
      * add trt noexcept on pool plugin
      
      * add trt noexcept on swish plugin
      
      * add trt noexcept on gelu plugin
      
      * add trt noexcept on layer norm plugin
      
      * add trt noexcept on instance norm plugin
      
      * add trt noexcept on emb eltwise layernorm plugin
      
      * add trt noexcept on qkv2context plugin
      
      * add trt noexcept on skip layernorm plugin
      
      * add trt noexcept on slice plugin
      
      * add trt noexcept on hard swish plugin
      
      * add trt noexcept on stack plugin
      
      * add trt noexcept on special slice plugin
      
      * add trt noexcept on anchor generator plugin
      
      * add trt noexcept on yolobox plugin
      
      * add trt noexcept on roi align plugin
      
      * add trt noexcept on gather nd plugin
      b5aab4f0
  2. 12 7月, 2021 1 次提交
    • Z
      [Paddle-TRT] IPluginExt -> IPluginV2 (#33680) · 394f92aa
      zlsh80826 提交于
      * add trt LT version helper
      
      * upgrade PluginTensorRT to IPluginV2Ext
      
      * trt plugin factory is not usable in IPluginV2
      
      * upgrade add plugin api to use IPluginV2
      
      * remove IPlugin register and adapt getSerializeSize(), serialize()
      
      * adapt IPluginV2Layer
      
      * downgrade to IPluginV2
      
      * implement elementwise clone
      
      * add gelu plugin creator and fix gelu serialization bug
      
      * add swish plugin creator and fix swish serialization bug
      
      * format
      
      * fix typo
      
      * add elementwise plugin creator and fix serialization
      
      * add base creator class
      
      * add gelu plugin creator
      
      * add hard swish creator and fix serialization
      
      * add instance norm creator and fix serialization
      
      * add layer norm creator and fix serialization
      
      * add pool creator and fix serialization
      
      * add prelu creator and fix serialization
      
      * add slice creator and fix serialization
      
      * add swish creator and fix serialization
      
      * add instance norm op unittest
      
      * remove redundent api
      
      * fix wrong graph size to enable trt
      
      * instance norm function move to cc
      
      * add trt elementwise ut to trigger coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove unused code
      
      * remove unused inputs_
      
      * add dbg info
      
      * remove dbg info
      
      * add instance norm serialization
      
      * roll back
      
      * remove comment code
      
      * remove trt plugin registery
      
      * fix prelu dynamic serialization
      
      * add prelu ut and reduce the input size to reduce memory usage
      
      * fix pool dynamic plugin serialization and add ut
      
      * refine pool ut with subtest
      
      * add env for avoiding oom
      
      * reduce test input size & increase pool op ut to 45s
      
      * add the contributor
      
      * remove copyright (will add in contributor)
      
      * remove copyright (will add in contributor)
      394f92aa
  3. 24 6月, 2021 1 次提交
  4. 23 3月, 2021 1 次提交
  5. 04 2月, 2021 1 次提交
  6. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  7. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  8. 14 4月, 2020 1 次提交
  9. 01 4月, 2020 1 次提交
  10. 05 9月, 2019 1 次提交
  11. 24 7月, 2019 1 次提交
    • Z
      Update trt5 for paddle-trt (#18645) · 26ae6d49
      Zhaolong Xing 提交于
      * update paddle-trt for:
          1. fix bug: when batch > 2, core in split plugin.
          2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.)
          3. add new attr to dropout.
          4. shuffle channel, swish, relu6 support
          test=develop
      
      * 1. fix ci
      test=develop
      26ae6d49
  12. 08 3月, 2019 1 次提交
    • N
      5. add static trt load model · f3d164fa
      nhzlx 提交于
      1). add static trt load model
      2). fix bug: when device_id is not 0, the trt will have a bug
      test=develop
      f3d164fa
  13. 22 2月, 2019 1 次提交
    • N
      5. add static trt load model · 1d5ef7c9
      nhzlx 提交于
      1). add static trt load model
      2). fix bug: when device_id is not 0, the trt will have a bug
      test=develop
      1d5ef7c9
  14. 21 11月, 2018 2 次提交
  15. 20 11月, 2018 1 次提交
  16. 16 11月, 2018 1 次提交
  17. 13 11月, 2018 3 次提交