1. 24 7月, 2019 1 次提交
    • Z
      Update trt5 for paddle-trt (#18645) · 26ae6d49
      Zhaolong Xing 提交于
      * update paddle-trt for:
          1. fix bug: when batch > 2, core in split plugin.
          2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.)
          3. add new attr to dropout.
          4. shuffle channel, swish, relu6 support
          test=develop
      
      * 1. fix ci
      test=develop
      26ae6d49
  2. 17 7月, 2019 2 次提交
    • G
      remove async executor and add data_feed.proto to the deps of train demo (#18659) · d714bf03
      guru4elephant 提交于
      * remove async executor and add data_feed.proto to the deps of train demo
      d714bf03
    • Fix Bitmain Predictor::Clone() (#18599) · 25d80791
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      
      * load model from buffer with length
      
      test=develop
      
      * modify the access level of class
      
      test=develop
      
      * support anakin for bitmain arch
      
      test=develop
      
      * remove files
      
      * checkout cmakelists
      
      test=develop
      
      * modify interfaces
      
      test=develop
      
      * add cmake dependments
      
      test=develop
      
      * enforce the outputs of net
      
      test=develop
      25d80791
  3. 11 7月, 2019 1 次提交
  4. 09 7月, 2019 1 次提交
  5. 08 7月, 2019 4 次提交
    • Z
      Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532) · 88b52a27
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      88b52a27
    • Support Bitmain Anakin (#18542) · 15291548
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      
      * load model from buffer with length
      
      test=develop
      
      * modify the access level of class
      
      test=develop
      
      * support anakin for bitmain arch
      
      test=develop
      
      * remove files
      
      * checkout cmakelists
      
      test=develop
      15291548
    • L
    • T
      add mkldnn shapeblob cache clear strategy (#18513) · fe32879d
      Tao Luo 提交于
      * add mkldnn shapeblob cache clear strategy
      
      test=develop
      
      * refine with comments
      
      test=develop
      
      * make cache clear strategy more safey
      
      test=develop
      
      * add lock for GetShapeBlobSize
      
      test=develop
      fe32879d
  6. 05 7月, 2019 1 次提交
  7. 03 7月, 2019 2 次提交
  8. 02 7月, 2019 1 次提交
  9. 01 7月, 2019 1 次提交
    • M
      Fix Pooling output scale (#18186) · 7023a86c
      Michał Gallus 提交于
      * Int8: Fix Pooling output scale
      
      test=develop
      
      * Update scales quantization for certain operators
      
      These include: concat, transpose, pool and reshape. test=develop
      
      * Move concat minimum scale finding to quantizer
      
      test=develop
      7023a86c
  10. 27 6月, 2019 3 次提交
  11. 21 6月, 2019 1 次提交
  12. 19 6月, 2019 3 次提交
  13. 16 6月, 2019 2 次提交
  14. 14 6月, 2019 1 次提交
    • L
      add Mobilienet ssd int8 analyzer tester (#18075) · 46625415
      lidanqing 提交于
      * add pascalvoc preprocess script and mobilenet-ssd analyzer_tester, wait 17737
      
      * change converting local dataset to downloading and converting tarfile
      test=develop
      
      * change the test data_path
      test=develop
      
      * change copyright (c) 2016 to copyright (c) 2019
      test=develop
      46625415
  15. 13 6月, 2019 2 次提交
  16. 12 6月, 2019 1 次提交
  17. 11 6月, 2019 2 次提交
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
    • Z
      Light mem reuse strategy for inference. (#17925) · 4e8d5a03
      Zhaolong Xing 提交于
      * fix: when use the load model from memory mode, the RAM occupy is high
      
      test=develop
      
      * ligth mem reuse
      test=develop
      
      * fix cpplint
      test=develop
      4e8d5a03
  18. 06 6月, 2019 4 次提交
  19. 03 6月, 2019 1 次提交
  20. 29 5月, 2019 3 次提交
  21. 28 5月, 2019 2 次提交
    • L
      Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570) · 04b6c29e
      lidanqing 提交于
      * add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
      test=develop
      
      * change fasle and 0.0 to fuse_brelu and brelu_threshold
      test=develop
      
      change the "fuse_relu||fuse_brelu" to "unsigned_output"
      test=develop
      
      * Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
      test=develop
      
      * continuous-integration fix
      test=develop
      04b6c29e
    • J
      [MKL-DNN] conv_transpose mkldnn bias pass (#17644) · 6d8075ec
      Jacek Czaja 提交于
      * - changes to graph detector
      
      - Changes to pass
      
      - Added ut for new pass
      
      - use_pass
      
      - Added pass to mkldnn passes
      
      - fix to registration
      
      - improved verbose messaging for conv bias passes
      
      - Lint fixes
      
      test=develop
      
      * - Lint fixes
      
      test=develop
      6d8075ec
  22. 27 5月, 2019 1 次提交
    • S
      add Concat quantization (#17448) · 96845d21
      Sylwester Fraczek 提交于
      * add Concat quantization
      add unit test for quantizing concat
      fix for wrong value when the input is not in map of calculated scales
      add use_quantizer to concat_op.cc
      add scale_algo rules for concat
      
      test=develop
      
      * missing fix for multiple inputs quantize-squash
      
      * wojtuss review fix: adding comment
      
      test=develop
      96845d21