1. 31 7月, 2019 1 次提交
    • Z
      Trt fp16 support (#18860) · 61238d31
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      
      * 1 add trt fp16 support
      test=develop
      61238d31
  2. 11 7月, 2019 1 次提交
  3. 08 7月, 2019 1 次提交
  4. 06 6月, 2019 2 次提交
    • Z
      ae576f3c
    • INT8 MKL-DNN v2 integrate to slim (#17634) · 993c703b
      翟飞跃 提交于
      * refactor PR 16865
      
      * delete mergetool files
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * create dir for int8 model before call SaveOptimModel
      
      * test=develop
      
      * mkldnn int8 only support linux; test=develop
      
      * refine code; test=develop
      
      * remove comment; test=develop
      
      * refine code; test=develop
      
      * fix bug; test=develop
      
      * add exception for mkldnn_post_training_strategy
      
      * reuse int8v2 CAPI dataset; test=develop
      
      * fix accuracy check bug; test=develop
      
      * remove tab
      
      * convert files to unix format
      
      * test=develop
      
      * reduce CI time;test=develop
      
      * reduce CI time and refine code;test=develop
      
      * refine comment; test=develop
      
      * add cmake FLAGS;test=develop
      
      * remove predict_num;test=develop
      993c703b
  5. 29 5月, 2019 1 次提交
  6. 25 5月, 2019 1 次提交
    • Z
      TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc
      Zhaolong Xing 提交于
      * fluid int8 train and trt int8 predict align.
      trt int8 predict init
      op converter
      
      * 2. align fluid int8 train and trt int8 inference.
      enhance quant dequant fuse pass
      enhance op converter, trt engine, trt engine op, trt subgraph pass.
      
      * 3. add delete_quant_dequant_pass for trt
      
      test=develop
      
      * 4. add the missing file
      test=develop
      
      * 5. i modify the c++ interface, but forget to modify the pybind code
      fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
      test=develop
      61221ebc
  7. 07 5月, 2019 1 次提交
    • Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a
      石晓伟 提交于
      * cherry-pick commit from 88770542
      
      * cherry-pick commit from 3f0b97df
      
      * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
      
      (cherry picked from commit 8643dbc2)
      
      * Cherry-Pick from 16662 : Anakin subgraph cpu support
      
      (cherry picked from commit 7ad182e1)
      
      * Cherry-pick from 1662, 16797.. : add anakin int8 support
      
      (cherry picked from commit e14ab180)
      
      * Cherry-pick from 16813 : change singleton to graph RegistBlock
      test=release/1.4
      
      (cherry picked from commit 4b9fa423)
      
      * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
      
      Support ShuffleNet and MobileNet-v2, test=release/1.4
      
      (cherry picked from commit a6fb066f)
      
      * Cherry-pick : anakin subgraph add opt config layout argument #16846
      test=release/1.4
      
      (cherry picked from commit 8121b3ec)
      
      * 1. add shuffle_channel_detect
      
      (cherry picked from commit 6efdea89)
      
      * update shuffle_channel op convert, test=release/1.4
      
      (cherry picked from commit e4726a06)
      
      * Modify symbol export rules
      
      test=develop
      a72dbe9a
  8. 29 3月, 2019 1 次提交
  9. 28 3月, 2019 1 次提交
  10. 20 3月, 2019 4 次提交
  11. 19 3月, 2019 2 次提交
  12. 15 3月, 2019 1 次提交
  13. 13 3月, 2019 1 次提交
  14. 08 3月, 2019 1 次提交
    • N
      cant not pass ci · 2891070c
      nhzlx 提交于
      add if use static engine for trt
      test=develop
      2891070c
  15. 07 3月, 2019 1 次提交
    • N
      cant not pass ci · a9ed4277
      nhzlx 提交于
      add if use static engine for trt
      test=develop
      a9ed4277
  16. 21 2月, 2019 1 次提交
  17. 31 1月, 2019 1 次提交
  18. 29 1月, 2019 1 次提交
  19. 26 1月, 2019 1 次提交
  20. 21 1月, 2019 1 次提交
  21. 16 1月, 2019 1 次提交
  22. 09 1月, 2019 1 次提交
  23. 08 1月, 2019 1 次提交
  24. 07 1月, 2019 1 次提交
  25. 26 12月, 2018 1 次提交
  26. 08 12月, 2018 1 次提交
  27. 06 12月, 2018 2 次提交
  28. 23 11月, 2018 1 次提交
  29. 15 11月, 2018 1 次提交
    • Y
      Refine tester of TensorRT engine (#14390) · 9e6b1c5f
      Yiqun Liu 提交于
      * Refine the tester for MixedRTPredictor.
      test=develop
      
      * Enable the profiler in TensorRT engine.
      
      * Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.
      9e6b1c5f
  30. 14 11月, 2018 1 次提交