1. 24 3月, 2021 1 次提交
  2. 04 1月, 2021 1 次提交
  3. 03 12月, 2020 1 次提交
  4. 13 11月, 2020 1 次提交
    • S
      Skip layernorm to 1.8 (#28583) · ec672e88
      Shang Zhizhou 提交于
      * 裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
      
      * skip_layernorm_op done
      
      * add unittest
      
      * slice op convertor support trt < 6
      
      * skip_layernorm only work in ernie
      
      * fix unittest
      
      * fix unittest
      ec672e88
  5. 09 11月, 2020 1 次提交
  6. 05 11月, 2020 1 次提交
    • S
      Ernie varlen to 1.8 (#28400) · 78d68d59
      Shang Zhizhou 提交于
      * Fix TRT plugin registry without TRT lib (#25982)
      
      * fix trt plugin registry without trt lib
      
      * support trt4
      
      * refine code style
      
      * pick ea851796 from develop
      
      * cherry-pick develop PR  #26273 && #27796
      
      * fix unittest error
      
      * fix unittest error
      
      * remove const_cast
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      78d68d59
  7. 07 8月, 2020 1 次提交
  8. 30 7月, 2020 1 次提交
    • Cherry-pick of lite engine, test=release/1.8 (#25817) · 45fa6861
      石晓伟 提交于
      * ignore warnings of external libraries, test=develop (#24193)
      
      * fix repeat definitions in liengine.cc, test=develop (#25020)
      
      * remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
      
      * fix compile for lite subgraph. test=develop (#25285)
      
      * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
      
      * supports xpu runtime, test=develop (#25554)
      
      * fix cmake of lite, test=develop (#25680)
      
      * change commit files, test=release/1.8
      Co-authored-by: NWilber <jiweibo@baidu.com>
      45fa6861
  9. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  10. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  11. 09 3月, 2020 1 次提交
  12. 24 2月, 2020 1 次提交
  13. 23 2月, 2020 1 次提交
  14. 10 2月, 2020 1 次提交
  15. 04 2月, 2020 1 次提交
  16. 14 1月, 2020 1 次提交
  17. 09 1月, 2020 1 次提交
  18. 07 1月, 2020 1 次提交
  19. 03 1月, 2020 1 次提交
  20. 04 12月, 2019 1 次提交
  21. 26 11月, 2019 1 次提交
    • G
      Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8
      GaoWei8 提交于
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      234060f8
  22. 25 11月, 2019 1 次提交
  23. 23 10月, 2019 1 次提交
  24. 25 9月, 2019 1 次提交
  25. 21 9月, 2019 2 次提交
  26. 17 9月, 2019 1 次提交
  27. 04 9月, 2019 1 次提交
    • B
      Enable ngraph through build_strategy (#19266) · a3a4b6e5
      baojun 提交于
      * enable ngraph throught build_strategy test=develop
      
      * add unittest test=develop
      
      * put use_ngraph unconditional test=develop
      
      * remove paddle_enforce test=develop
      
      * remove paddle_enforce test=develop
      
      * fix copyright test=develop
      
      * limit for ngraph only test=develop
      a3a4b6e5
  28. 03 9月, 2019 1 次提交
    • Y
      A a pass to enable the use of cudnn (#19346) · c5548178
      Yiqun Liu 提交于
      * Add a interface to enable cudnn for inference.
      
      * Add cudnn_placement_pass.
      test=develop
      
      * Set the default value of cudnn_enabled_op_types to null.
      test=develop
      
      * Write the common basic class, placement_pass_base, to refine the codes.
      test=develop
      
      * Call EnableCUDNN in unittest.
      test=develop
      
      * Refine cudnn_placement_pass tester.
      
      * Enable the testing of cudnn_placement_pass in inference's unittest.
      test=develop
      
      * Add the check of op kernels.
      test=develop
      c5548178
  29. 19 8月, 2019 1 次提交
  30. 31 7月, 2019 1 次提交
    • Z
      Trt fp16 support (#18860) · 61238d31
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      
      * 1 add trt fp16 support
      test=develop
      61238d31
  31. 11 7月, 2019 1 次提交
  32. 08 7月, 2019 1 次提交
  33. 16 6月, 2019 1 次提交
  34. 11 6月, 2019 2 次提交
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
    • Z
      Light mem reuse strategy for inference. (#17925) · 4e8d5a03
      Zhaolong Xing 提交于
      * fix: when use the load model from memory mode, the RAM occupy is high
      
      test=develop
      
      * ligth mem reuse
      test=develop
      
      * fix cpplint
      test=develop
      4e8d5a03
  35. 06 6月, 2019 1 次提交
  36. 29 5月, 2019 1 次提交
  37. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc
  38. 16 5月, 2019 1 次提交