1. 17 9月, 2019 1 次提交
  2. 03 9月, 2019 1 次提交
    • Y
      A a pass to enable the use of cudnn (#19346) · c5548178
      Yiqun Liu 提交于
      * Add a interface to enable cudnn for inference.
      
      * Add cudnn_placement_pass.
      test=develop
      
      * Set the default value of cudnn_enabled_op_types to null.
      test=develop
      
      * Write the common basic class, placement_pass_base, to refine the codes.
      test=develop
      
      * Call EnableCUDNN in unittest.
      test=develop
      
      * Refine cudnn_placement_pass tester.
      
      * Enable the testing of cudnn_placement_pass in inference's unittest.
      test=develop
      
      * Add the check of op kernels.
      test=develop
      c5548178
  3. 31 7月, 2019 1 次提交
    • Z
      Trt fp16 support (#18860) · 61238d31
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      
      * 1 add trt fp16 support
      test=develop
      61238d31
  4. 11 7月, 2019 1 次提交
  5. 08 7月, 2019 1 次提交
  6. 06 6月, 2019 2 次提交
    • Z
      ae576f3c
    • INT8 MKL-DNN v2 integrate to slim (#17634) · 993c703b
      翟飞跃 提交于
      * refactor PR 16865
      
      * delete mergetool files
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * create dir for int8 model before call SaveOptimModel
      
      * test=develop
      
      * mkldnn int8 only support linux; test=develop
      
      * refine code; test=develop
      
      * remove comment; test=develop
      
      * refine code; test=develop
      
      * fix bug; test=develop
      
      * add exception for mkldnn_post_training_strategy
      
      * reuse int8v2 CAPI dataset; test=develop
      
      * fix accuracy check bug; test=develop
      
      * remove tab
      
      * convert files to unix format
      
      * test=develop
      
      * reduce CI time;test=develop
      
      * reduce CI time and refine code;test=develop
      
      * refine comment; test=develop
      
      * add cmake FLAGS;test=develop
      
      * remove predict_num;test=develop
      993c703b
  7. 29 5月, 2019 1 次提交
  8. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc
  9. 07 5月, 2019 1 次提交
    • Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a
      石晓伟 提交于
      * cherry-pick commit from 88770542
      
      * cherry-pick commit from 3f0b97df
      
      * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
      
      (cherry picked from commit 8643dbc2)
      
      * Cherry-Pick from 16662 : Anakin subgraph cpu support
      
      (cherry picked from commit 7ad182e1)
      
      * Cherry-pick from 1662, 16797.. : add anakin int8 support
      
      (cherry picked from commit e14ab180)
      
      * Cherry-pick from 16813 : change singleton to graph RegistBlock
      test=release/1.4
      
      (cherry picked from commit 4b9fa423)
      
      * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
      
      Support ShuffleNet and MobileNet-v2, test=release/1.4
      
      (cherry picked from commit a6fb066f)
      
      * Cherry-pick : anakin subgraph add opt config layout argument #16846
      test=release/1.4
      
      (cherry picked from commit 8121b3ec)
      
      * 1. add shuffle_channel_detect
      
      (cherry picked from commit 6efdea89)
      
      * update shuffle_channel op convert, test=release/1.4
      
      (cherry picked from commit e4726a06)
      
      * Modify symbol export rules
      
      test=develop
      a72dbe9a
  10. 29 3月, 2019 1 次提交
  11. 28 3月, 2019 1 次提交
  12. 20 3月, 2019 4 次提交
  13. 19 3月, 2019 2 次提交
  14. 15 3月, 2019 1 次提交
  15. 13 3月, 2019 1 次提交
  16. 08 3月, 2019 1 次提交
    • N
      cant not pass ci · 2891070c
      nhzlx 提交于
      add if use static engine for trt
      test=develop
      2891070c
  17. 07 3月, 2019 1 次提交
    • N
      cant not pass ci · a9ed4277
      nhzlx 提交于
      add if use static engine for trt
      test=develop
      a9ed4277
  18. 21 2月, 2019 1 次提交
  19. 31 1月, 2019 1 次提交
  20. 29 1月, 2019 1 次提交
  21. 26 1月, 2019 1 次提交
  22. 21 1月, 2019 1 次提交
  23. 16 1月, 2019 1 次提交
  24. 09 1月, 2019 1 次提交
  25. 08 1月, 2019 1 次提交
  26. 07 1月, 2019 1 次提交
  27. 26 12月, 2018 1 次提交
  28. 08 12月, 2018 1 次提交
  29. 06 12月, 2018 2 次提交
  30. 23 11月, 2018 1 次提交
  31. 15 11月, 2018 1 次提交
    • Y
      Refine tester of TensorRT engine (#14390) · 9e6b1c5f
      Yiqun Liu 提交于
      * Refine the tester for MixedRTPredictor.
      test=develop
      
      * Enable the profiler in TensorRT engine.
      
      * Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.
      9e6b1c5f
  32. 14 11月, 2018 1 次提交