1. 29 10月, 2018 2 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) test=release/1.1 · cb274159
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      cb274159
    • X
      Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape · 415f5006
      Xin Pan 提交于
      test=release/1.1
      
      [1.1] Cpu for 1.1 merge with shape
      415f5006
  2. 28 10月, 2018 1 次提交
    • W
      [1.1] fix graph num hang (#14072) · 9da9b192
      Wu Yi 提交于
      * fix graph num hang test=develop
      
      * re-enable tests test=develop
      
      * re-enable graph num check test=develop
      
      * fix multi device pass role check test=develop
      9da9b192
  3. 25 10月, 2018 1 次提交
  4. 23 10月, 2018 1 次提交
  5. 21 10月, 2018 1 次提交
  6. 15 10月, 2018 2 次提交
  7. 30 9月, 2018 1 次提交
  8. 29 9月, 2018 1 次提交
  9. 27 9月, 2018 1 次提交
    • C
      Add GraphChecker (#13580) · 5175b3cb
      chengduo 提交于
      * add GraphNum
      
      test=develop
      
      * add graph number check in parallelExecutor
      
      test=develop
      
      * fix transformer_model bug
      
      test=develop
      
      * fix graph num
      5175b3cb
  10. 25 9月, 2018 1 次提交
  11. 20 9月, 2018 1 次提交
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  12. 17 9月, 2018 2 次提交
  13. 15 9月, 2018 1 次提交
  14. 10 9月, 2018 2 次提交
  15. 14 8月, 2018 1 次提交
  16. 09 8月, 2018 1 次提交
  17. 27 7月, 2018 1 次提交
  18. 26 7月, 2018 5 次提交
  19. 22 7月, 2018 1 次提交
  20. 18 7月, 2018 5 次提交
  21. 15 7月, 2018 1 次提交
  22. 13 7月, 2018 1 次提交
    • C
      Refine multi thread cpu parallel exe (#11406) · 86b0a725
      chengduo 提交于
      * refine multi-thread CPU Parallel exe
      
      * refine multi thread CPU Parallel exe
      
      * Refine CPU version for ParallelExecutor
      
      * add share_parameter_between_cards_
      
      * Fix ParallelExecutor bug
      
      * Fix unit test
      
      * Fix parameter opt balance
      
      * Fix with opti (param->grad)
      
      * Add grad to op var
      
      * Remove shard_param_between_cards
      86b0a725
  23. 12 7月, 2018 2 次提交
  24. 29 6月, 2018 1 次提交
  25. 28 6月, 2018 1 次提交
  26. 26 6月, 2018 2 次提交