1. 22 9月, 2022 1 次提交
  2. 05 9月, 2022 2 次提交
    • Y
      New format quant model support for MKLDNN (#45416) · 4e4f4586
      yeliang2258 提交于
      * support onnx format quantized model
      
      * update code
      
      * add test
      
      * add test
      
      * fix
      
      * fix test
      
      * fix cmake
      
      * update code
      
      * change scale file path to calibration file path
      
      * update code
      
      * update code
      
      * fix build bug
      
      * fix build bugs
      
      * fix
      
      * fix
      4e4f4586
    • D
      Update DlNNE engine (#45027) · 638965c5
      denglin-github 提交于
      * add config param for enable_dlnne and support calibration mode
      * remove useless file
      * refine code and add annotation
      * refine code of Warnning tips
      638965c5
  3. 05 8月, 2022 1 次提交
  4. 08 7月, 2022 1 次提交
  5. 05 7月, 2022 1 次提交
  6. 29 6月, 2022 1 次提交
  7. 24 6月, 2022 1 次提交
  8. 02 6月, 2022 1 次提交
  9. 14 4月, 2022 1 次提交
  10. 17 3月, 2022 1 次提交
  11. 23 2月, 2022 1 次提交
  12. 11 2月, 2022 1 次提交
  13. 26 1月, 2022 1 次提交
  14. 13 1月, 2022 1 次提交
  15. 15 12月, 2021 1 次提交
  16. 22 10月, 2021 1 次提交
  17. 22 9月, 2021 1 次提交
  18. 14 9月, 2021 1 次提交
  19. 25 4月, 2021 2 次提交
    • W
      update lite subgraph api. (#32513) · 92dc9b2b
      Wilber 提交于
      92dc9b2b
    • D
      Nne integration (#32255) · feb2e476
      denglin-github 提交于
      * Add dlnne engine runtime
      
      * Fix log
      
      * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format
      
      * Fix CMakeList format error
      
      * Add copyright message
      
      * Fix dlnne CMakeList.txt
      
      * Add some paddlepaddle_pass to support more networks
      
      * Fix some format bug
      feb2e476
  20. 25 1月, 2021 1 次提交
  21. 06 1月, 2021 1 次提交
  22. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  23. 13 10月, 2020 1 次提交
  24. 16 9月, 2020 1 次提交
  25. 11 9月, 2020 1 次提交
  26. 22 7月, 2020 1 次提交
    • supports xpu runtime, test=develop (#25554) · 72064172
      石晓伟 提交于
      * update ResetHolder, test=develop
      
      * add TensorShare for lite engine, test=develop
      
      * tensor data changed from copying to sharing, test=develop
      
      * supports xpu runtime, test=develop
      
      * fix code styles, test=develop
      72064172
  27. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  28. 09 3月, 2020 1 次提交
  29. 24 2月, 2020 1 次提交
  30. 04 2月, 2020 1 次提交
  31. 09 1月, 2020 1 次提交
  32. 04 12月, 2019 1 次提交
  33. 25 9月, 2019 1 次提交
  34. 19 8月, 2019 1 次提交
  35. 11 7月, 2019 1 次提交
  36. 08 7月, 2019 1 次提交
  37. 06 6月, 2019 1 次提交
  38. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc