1. 05 11月, 2020 1 次提交
    • S
      Ernie varlen to 1.8 (#28400) · 78d68d59
      Shang Zhizhou 提交于
      * Fix TRT plugin registry without TRT lib (#25982)
      
      * fix trt plugin registry without trt lib
      
      * support trt4
      
      * refine code style
      
      * pick ea851796 from develop
      
      * cherry-pick develop PR  #26273 && #27796
      
      * fix unittest error
      
      * fix unittest error
      
      * remove const_cast
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      78d68d59
  2. 07 8月, 2020 1 次提交
  3. 30 7月, 2020 1 次提交
    • Cherry-pick of lite engine, test=release/1.8 (#25817) · 45fa6861
      石晓伟 提交于
      * ignore warnings of external libraries, test=develop (#24193)
      
      * fix repeat definitions in liengine.cc, test=develop (#25020)
      
      * remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
      
      * fix compile for lite subgraph. test=develop (#25285)
      
      * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
      
      * supports xpu runtime, test=develop (#25554)
      
      * fix cmake of lite, test=develop (#25680)
      
      * change commit files, test=release/1.8
      Co-authored-by: NWilber <jiweibo@baidu.com>
      45fa6861
  4. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  5. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  6. 09 3月, 2020 1 次提交
  7. 24 2月, 2020 1 次提交
  8. 23 2月, 2020 1 次提交
  9. 10 2月, 2020 1 次提交
  10. 04 2月, 2020 1 次提交
  11. 14 1月, 2020 1 次提交
  12. 09 1月, 2020 1 次提交
  13. 07 1月, 2020 1 次提交
  14. 03 1月, 2020 1 次提交
  15. 04 12月, 2019 1 次提交
  16. 26 11月, 2019 1 次提交
    • G
      Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8
      GaoWei8 提交于
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      234060f8
  17. 25 11月, 2019 1 次提交
  18. 23 10月, 2019 1 次提交
  19. 25 9月, 2019 1 次提交
  20. 21 9月, 2019 2 次提交
  21. 17 9月, 2019 1 次提交
  22. 04 9月, 2019 1 次提交
    • B
      Enable ngraph through build_strategy (#19266) · a3a4b6e5
      baojun 提交于
      * enable ngraph throught build_strategy test=develop
      
      * add unittest test=develop
      
      * put use_ngraph unconditional test=develop
      
      * remove paddle_enforce test=develop
      
      * remove paddle_enforce test=develop
      
      * fix copyright test=develop
      
      * limit for ngraph only test=develop
      a3a4b6e5
  23. 03 9月, 2019 1 次提交
    • Y
      A a pass to enable the use of cudnn (#19346) · c5548178
      Yiqun Liu 提交于
      * Add a interface to enable cudnn for inference.
      
      * Add cudnn_placement_pass.
      test=develop
      
      * Set the default value of cudnn_enabled_op_types to null.
      test=develop
      
      * Write the common basic class, placement_pass_base, to refine the codes.
      test=develop
      
      * Call EnableCUDNN in unittest.
      test=develop
      
      * Refine cudnn_placement_pass tester.
      
      * Enable the testing of cudnn_placement_pass in inference's unittest.
      test=develop
      
      * Add the check of op kernels.
      test=develop
      c5548178
  24. 19 8月, 2019 1 次提交
  25. 31 7月, 2019 1 次提交
    • Z
      Trt fp16 support (#18860) · 61238d31
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      
      * 1 add trt fp16 support
      test=develop
      61238d31
  26. 11 7月, 2019 1 次提交
  27. 08 7月, 2019 1 次提交
  28. 16 6月, 2019 1 次提交
  29. 11 6月, 2019 2 次提交
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
    • Z
      Light mem reuse strategy for inference. (#17925) · 4e8d5a03
      Zhaolong Xing 提交于
      * fix: when use the load model from memory mode, the RAM occupy is high
      
      test=develop
      
      * ligth mem reuse
      test=develop
      
      * fix cpplint
      test=develop
      4e8d5a03
  30. 06 6月, 2019 1 次提交
  31. 29 5月, 2019 1 次提交
  32. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc
  33. 16 5月, 2019 1 次提交
  34. 09 5月, 2019 1 次提交
    • Z
      fix: (#17279) · 7a3bb061
      Zhaolong Xing 提交于
      1. infernce multi card occupy
      2. facebox model inference occupy too much
      test=develop
      7a3bb061
  35. 07 5月, 2019 1 次提交
    • Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a
      石晓伟 提交于
      * cherry-pick commit from 88770542
      
      * cherry-pick commit from 3f0b97df
      
      * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
      
      (cherry picked from commit 8643dbc2)
      
      * Cherry-Pick from 16662 : Anakin subgraph cpu support
      
      (cherry picked from commit 7ad182e1)
      
      * Cherry-pick from 1662, 16797.. : add anakin int8 support
      
      (cherry picked from commit e14ab180)
      
      * Cherry-pick from 16813 : change singleton to graph RegistBlock
      test=release/1.4
      
      (cherry picked from commit 4b9fa423)
      
      * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
      
      Support ShuffleNet and MobileNet-v2, test=release/1.4
      
      (cherry picked from commit a6fb066f)
      
      * Cherry-pick : anakin subgraph add opt config layout argument #16846
      test=release/1.4
      
      (cherry picked from commit 8121b3ec)
      
      * 1. add shuffle_channel_detect
      
      (cherry picked from commit 6efdea89)
      
      * update shuffle_channel op convert, test=release/1.4
      
      (cherry picked from commit e4726a06)
      
      * Modify symbol export rules
      
      test=develop
      a72dbe9a
  36. 29 3月, 2019 2 次提交
  37. 28 3月, 2019 1 次提交
    • N
      Anakin ssd support · d065b5bf
      nhzlx 提交于
      refine trt first run
      add quant dequant fuse pass
      omit simplify_anakin_priorbox_detection template
      omit transpose_flatten_concat_fuse template
      test=develop
      d065b5bf