1. 20 11月, 2018 1 次提交
  2. 08 11月, 2018 1 次提交
  3. 07 11月, 2018 1 次提交
  4. 06 11月, 2018 1 次提交
  5. 31 10月, 2018 1 次提交
  6. 29 10月, 2018 1 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
  7. 28 10月, 2018 1 次提交
    • W
      [1.1] fix graph num hang (#14072) · 9da9b192
      Wu Yi 提交于
      * fix graph num hang test=develop
      
      * re-enable tests test=develop
      
      * re-enable graph num check test=develop
      
      * fix multi device pass role check test=develop
      9da9b192
  8. 27 10月, 2018 1 次提交
  9. 25 10月, 2018 2 次提交
  10. 23 10月, 2018 1 次提交
  11. 21 10月, 2018 1 次提交
  12. 15 10月, 2018 2 次提交
  13. 30 9月, 2018 1 次提交
  14. 29 9月, 2018 1 次提交
  15. 27 9月, 2018 1 次提交
    • C
      Add GraphChecker (#13580) · 5175b3cb
      chengduo 提交于
      * add GraphNum
      
      test=develop
      
      * add graph number check in parallelExecutor
      
      test=develop
      
      * fix transformer_model bug
      
      test=develop
      
      * fix graph num
      5175b3cb
  16. 25 9月, 2018 1 次提交
  17. 20 9月, 2018 1 次提交
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  18. 17 9月, 2018 2 次提交
  19. 15 9月, 2018 1 次提交
  20. 10 9月, 2018 2 次提交
  21. 14 8月, 2018 1 次提交
  22. 09 8月, 2018 1 次提交
  23. 27 7月, 2018 1 次提交
  24. 26 7月, 2018 5 次提交
  25. 22 7月, 2018 1 次提交
  26. 18 7月, 2018 5 次提交
  27. 15 7月, 2018 1 次提交
  28. 13 7月, 2018 1 次提交
    • C
      Refine multi thread cpu parallel exe (#11406) · 86b0a725
      chengduo 提交于
      * refine multi-thread CPU Parallel exe
      
      * refine multi thread CPU Parallel exe
      
      * Refine CPU version for ParallelExecutor
      
      * add share_parameter_between_cards_
      
      * Fix ParallelExecutor bug
      
      * Fix unit test
      
      * Fix parameter opt balance
      
      * Fix with opti (param->grad)
      
      * Add grad to op var
      
      * Remove shard_param_between_cards
      86b0a725