1. 26 8月, 2022 1 次提交
  2. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  3. 01 8月, 2022 1 次提交
  4. 19 7月, 2022 1 次提交
  5. 08 7月, 2022 1 次提交
  6. 22 6月, 2022 1 次提交
  7. 20 6月, 2022 1 次提交
  8. 02 6月, 2022 1 次提交
  9. 25 5月, 2022 1 次提交
  10. 02 4月, 2022 1 次提交
  11. 04 3月, 2022 1 次提交
  12. 11 2月, 2022 1 次提交
    • W
      [Paddle Inference] support ernie quant model with interleaved (#39424) · 1c44d3e2
      Wangzheee 提交于
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      1c44d3e2
  13. 24 9月, 2021 1 次提交
  14. 07 9月, 2021 1 次提交
  15. 30 8月, 2021 1 次提交
  16. 17 6月, 2021 1 次提交
  17. 16 4月, 2021 1 次提交
  18. 02 4月, 2021 1 次提交
  19. 23 3月, 2021 1 次提交
    • S
      fix tensorrt output varible reshape (#31733) · 9d04ef73
      Shang Zhizhou 提交于
      * fix tensorrt output varible reshape
      
      * move padding shape x 1 x 1 in ernie to qkv and fc
      
      * update layer name
      
      * fix softmax when input is dynamic, fc not padding any more
      
      * fix varlen
      
      * move fc x_dim assert to op_teller
      9d04ef73
  20. 10 3月, 2021 1 次提交
  21. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  22. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  23. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  24. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  25. 06 1月, 2020 1 次提交