1. 11 10月, 2019 1 次提交
  2. 10 10月, 2019 2 次提交
  3. 30 9月, 2019 1 次提交
  4. 28 9月, 2019 2 次提交
  5. 26 9月, 2019 1 次提交
  6. 25 9月, 2019 1 次提交
    • W
      Add support for new QAT models (#18970) · 4286a627
      Wojciech Uss 提交于
      * Add support for new QAT models
      
      test=develop
      Co-Authored-By: NMichał Gallus <michal.gallus@intel.com>
      Co-Authored-By: NWojciech Uss <wojciech.uss@intel.com>
      
      * fixed fps results
      
      test=develop
      
      * fix top5 accuracy drop problem
      
      * updated for new QAT models
      
      * skip quantizing average pooling - dirty but working
      
      * add missing pass
      
      * added missing conv+brelu fuse pass
      
      * removed a call to non-existent pass
      
      test=develop
      
      * renamed pass
      
      test=develop
      
      * Adjust finding pooling scale to newest QAT models
      
      * Remove unnecessary code from quantization_mkldnn_pass
      
      * Copy Pooling input scale to output scale in QAT
      
      * Refactor & remove unused code in QAT
      
      * Incorporate fp32 FC into QAT
      
      test=develop
      
      * Enable graph drawing with debug flag
      
      test=develop
      
      * Add tests for QATv2
      
      * Fix paths for QATv2 models
      
      test=develop
      
      * Add option to save transformed int8 qat model
      
      test=develop
      
      * Remove redundant lines from qat mkldnn pass
      
      test=develop
      
      * Delegate disablement of avg pooling to qat
      
      test=develop
      
      * fix CI bug, test=develop
      
      * Follow Wangzhen's Review, test=develop
      
      * Update API.spec
      
      test=develop
      
      * Name False in (is_unsigned, TensorScale) tuple
      
      test=develop
      4286a627
  7. 24 9月, 2019 2 次提交
    • J
      add optimizer:dpsgd,test=develop (#19915) · 766bd529
      jhjiangcs 提交于
      766bd529
    • W
      [PaddleSlim] Enhence compressor api in PaddleSlim (#19894) · bdb3e376
      whs 提交于
      
      1. Support customize eval function instead of eval program.
      2. Fix loading checkpoint in quantization strategy.
      3. Support saving eval model when saving a checkpoint.
      4. Fix decoder of loading context in PaddleSlim.
      5. Fix restoring from the checkpoint of uniform prune strategy.
      6. Support saving eval model and infer model during training.
      7. Add ‘unitest’ for saving eval model, saving infer model and uniform pruning restoring from the checkpoint.
      8. Fix pruning of depthwise_conv_grad op by updating the groups.
      bdb3e376
  8. 23 9月, 2019 2 次提交
  9. 21 9月, 2019 1 次提交
  10. 19 9月, 2019 1 次提交
  11. 18 9月, 2019 1 次提交
  12. 11 9月, 2019 1 次提交
  13. 10 9月, 2019 1 次提交
  14. 06 9月, 2019 2 次提交
  15. 05 9月, 2019 1 次提交
  16. 03 9月, 2019 2 次提交
  17. 31 8月, 2019 1 次提交
  18. 29 8月, 2019 1 次提交
  19. 26 8月, 2019 2 次提交
  20. 23 8月, 2019 2 次提交
    • J
      Update Light-NAS to support latency-aware search (#19050) · 1799c257
      Jiancheng Li 提交于
      * update light_nas_strategy: add latency constraint
      
      test=develop
      
      * update light_nas_strategy: update get_model_latency
      
      test=develop
      
      * update light_nas_strategy: add more check
      
      test=develop
      
      * update light_nas test
      
      test=develop
      
      * update light_nas test
      
          test=develop
      
      * minor update light_nas test
      
          test=develop
      
      * minor update light_nas test
      
      test=develop
      
      * update light_nas test
      
      test=develop
      
      * update _constrain_func of light_nas_strategy
      
      test=develop
      
      * update _constrain_func of light_nas_strategy
      
      test=develop
      
      * remove unused code
      
      test=develop
      1799c257
    • Z
      Add the max-pool2d quantization support and the partial quantization support. (#19310) · 0fe72469
      Zhen Wang 提交于
      * add pool2d quantization support, only for max-pooling.
      
      * add the partial quantization support.
      0fe72469
  21. 16 8月, 2019 1 次提交
  22. 11 8月, 2019 1 次提交
    • Y
      add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50
      yaoxuefeng 提交于
      add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
      
      * add ctr related metric layer test=develop
      
      * add save cache and slots shuffle test=develop
      
      * add save cache and slots shuffle test=develop
      
      * fix error
      
      * fix error
      
      * fix style for ci
      
      * fix for comments
      
      * change SlotsShuffle input to std::strinf for generality
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix stylr
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * change non-const reference to pointer
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style  test=develop
      
      * add return ins num in ctr metric op
      
      * change dtype to float in metric_op.py
      
      * fix error test=develop
      
      * fix style test=develop
      
      * fix API spec
      
      * fix API spec
      
      * fix API spec test=develop
      
      * add UT test=develop
      9150cf50
  23. 05 8月, 2019 1 次提交
  24. 01 8月, 2019 1 次提交
  25. 22 7月, 2019 1 次提交
  26. 18 7月, 2019 1 次提交
  27. 09 7月, 2019 1 次提交
  28. 28 6月, 2019 2 次提交
  29. 27 6月, 2019 1 次提交
  30. 26 6月, 2019 1 次提交
  31. 25 6月, 2019 1 次提交
    • H
      Sequence mask support tensor (#18249) · df2eee71
      Hongyu Liu 提交于
      * sequnce mask support max length tensor input; test=develop
      
      * add rnn_impl.py; test=develop
      
      * add basic gru lstm unittest; test=develop
      
      * fix api spec; test=develop
      
      * fix sequence_mask op bug;
      test=develop
      test=document_preview
      
      * change +-*x to elmentwise_op; test=develop
      
      * add mkl flag; test=develop
      
      * fix rnn impl bug; test=develop
      
      * update api spec; test=develop
      
      * fix doc bug; test=develop
      
      * fix lstm bugs; test=develop
      df2eee71