1. 12 7月, 2021 1 次提交
    • Z
      [Paddle-TRT] IPluginExt -> IPluginV2 (#33680) · 394f92aa
      zlsh80826 提交于
      * add trt LT version helper
      
      * upgrade PluginTensorRT to IPluginV2Ext
      
      * trt plugin factory is not usable in IPluginV2
      
      * upgrade add plugin api to use IPluginV2
      
      * remove IPlugin register and adapt getSerializeSize(), serialize()
      
      * adapt IPluginV2Layer
      
      * downgrade to IPluginV2
      
      * implement elementwise clone
      
      * add gelu plugin creator and fix gelu serialization bug
      
      * add swish plugin creator and fix swish serialization bug
      
      * format
      
      * fix typo
      
      * add elementwise plugin creator and fix serialization
      
      * add base creator class
      
      * add gelu plugin creator
      
      * add hard swish creator and fix serialization
      
      * add instance norm creator and fix serialization
      
      * add layer norm creator and fix serialization
      
      * add pool creator and fix serialization
      
      * add prelu creator and fix serialization
      
      * add slice creator and fix serialization
      
      * add swish creator and fix serialization
      
      * add instance norm op unittest
      
      * remove redundent api
      
      * fix wrong graph size to enable trt
      
      * instance norm function move to cc
      
      * add trt elementwise ut to trigger coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove opt cahce to hit serialization coverage
      
      * remove unused code
      
      * remove unused inputs_
      
      * add dbg info
      
      * remove dbg info
      
      * add instance norm serialization
      
      * roll back
      
      * remove comment code
      
      * remove trt plugin registery
      
      * fix prelu dynamic serialization
      
      * add prelu ut and reduce the input size to reduce memory usage
      
      * fix pool dynamic plugin serialization and add ut
      
      * refine pool ut with subtest
      
      * add env for avoiding oom
      
      * reduce test input size & increase pool op ut to 45s
      
      * add the contributor
      
      * remove copyright (will add in contributor)
      
      * remove copyright (will add in contributor)
      394f92aa
  2. 24 6月, 2021 1 次提交
    • Z
      [Paddle-TRT] Remove TensorRT deprecated API (#33654) · b6e9498e
      zlsh80826 提交于
      * add trt LT version helper
      
      * remove deprecated nvinfer1::DimsCHW and replace it to nvinfer1::Dims3
      
      * remove deprecated nvinfer1::DimsNCHW and replace it to nvinfer1::Dims4
      
      * update deserialize engine
      
      * update to createNetworkV2
      
      * update to createNetworkV2
      
      * update buildWithConfig and remove redundent config settings
      
      * replace createNetwork to createNetworkV2
      
      * fix int8
      
      * addMatrixMultiply
      
      * remove unnecessary const cast
      
      * IBuilder->setInt8Calibrator() is deprecated
      
      * auto enable fp16 when using int8
      
      * remove the redundant line
      b6e9498e
  3. 23 4月, 2021 1 次提交
    • W
      move semantic checks to op_teller (#32279) · 7c38114f
      wenbin 提交于
      * move semantic checks to op_teller
      
      * more ops
      
      * more ops
      
      * revert block related change
      
      * part1
      
      * revert activation
      
      * remove if
      
      * remove const_cast
      
      * reslove conflict
      
      * remove const_cast
      
      * delete useless var
      
      * replace vlog(1) with vlog(3), replace assert with PADDLE_ENFORCE
      
      * down to 19 files
      7c38114f
  4. 04 2月, 2021 1 次提交
  5. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  6. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  7. 12 4月, 2020 1 次提交
  8. 10 4月, 2020 1 次提交
  9. 24 7月, 2019 1 次提交
    • Z
      Update trt5 for paddle-trt (#18645) · 26ae6d49
      Zhaolong Xing 提交于
      * update paddle-trt for:
          1. fix bug: when batch > 2, core in split plugin.
          2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.)
          3. add new attr to dropout.
          4. shuffle channel, swish, relu6 support
          test=develop
      
      * 1. fix ci
      test=develop
      26ae6d49
  10. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc
  11. 13 11月, 2018 1 次提交
  12. 12 11月, 2018 1 次提交
  13. 08 11月, 2018 1 次提交
  14. 13 9月, 2018 2 次提交
  15. 24 8月, 2018 1 次提交
  16. 25 7月, 2018 1 次提交
  17. 24 7月, 2018 2 次提交
  18. 07 6月, 2018 2 次提交
  19. 06 6月, 2018 1 次提交
  20. 01 6月, 2018 1 次提交
  21. 14 5月, 2018 1 次提交
  22. 03 5月, 2018 1 次提交
  23. 27 4月, 2018 1 次提交
  24. 25 4月, 2018 2 次提交
  25. 23 4月, 2018 1 次提交
  26. 26 2月, 2018 2 次提交
  27. 12 2月, 2018 1 次提交
  28. 10 2月, 2018 2 次提交
  29. 09 1月, 2018 1 次提交
    • Y
      Port WarpCTC Operator (#5107) · b5fda272
      Yiqun Liu 提交于
      * Add Seq2BatchFunctor, which will be used in WarpCTCOp.
      
      * Implement WrapCTCFunctor and WrapCTCKernel.
      
      * Add unittest of warpctc_op.
      
      * Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
      
      * Use absolute offset lod in warpctc_op and related functors.
      
      * Refine the comments of warpctc_op.
      
      * The new python unittest supports checking a subset of the outputs, so revoke the previous change.
      
      * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
      
      * Update to the newest codes.
      
      * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
      b5fda272
  30. 04 8月, 2017 1 次提交
  31. 11 7月, 2017 1 次提交