1. 29 10月, 2018 4 次提交
    • S
      test=develop · f2eed667
      sneaxiy 提交于
      f2eed667
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
    • S
      test=develop · 2414f92f
      sneaxiy 提交于
      2414f92f
    • S
      move to pass · 45559d04
      sneaxiy 提交于
      test=develop
      45559d04
  2. 25 10月, 2018 1 次提交
  3. 21 10月, 2018 1 次提交
  4. 15 10月, 2018 1 次提交
  5. 12 10月, 2018 1 次提交
  6. 30 9月, 2018 1 次提交
  7. 25 9月, 2018 5 次提交
  8. 21 9月, 2018 2 次提交
  9. 20 9月, 2018 4 次提交
    • S
      enhance eager deletion · 0a36ef3c
      sneaxiy 提交于
      0a36ef3c
    • Y
      Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod"" · 6d2c6f96
      Yu Yang 提交于
      This reverts commit a6c8d6b9.
      6d2c6f96
    • Y
      Revert "Merge pull request #13431 from chengduoZH/refine_lod" · a6c8d6b9
      Yu Yang 提交于
      This reverts commit bd79e046, reversing
      changes made to 6b4d290c.
      a6c8d6b9
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  10. 19 9月, 2018 1 次提交
  11. 17 9月, 2018 3 次提交
  12. 15 9月, 2018 1 次提交
  13. 14 9月, 2018 1 次提交
  14. 13 9月, 2018 4 次提交
  15. 12 9月, 2018 1 次提交
  16. 11 9月, 2018 1 次提交
  17. 10 9月, 2018 2 次提交
  18. 04 9月, 2018 1 次提交
  19. 31 8月, 2018 3 次提交
  20. 28 8月, 2018 1 次提交
    • W
      Refine dist rpc deps (#12899) · 0ee6fed0
      Wu Yi 提交于
      * refine dist train RPC deps
      
      * clean up
      
      * clean up
      
      * fix ut
      
      * remove input for fetch_barrier
      
      * follow comments
      0ee6fed0
  21. 23 8月, 2018 1 次提交
    • W
      Resovle multi gpu async deps (#12828) · b8da70c3
      Wu Yi 提交于
      * dist transpiler add control dependency var between send and recv
      
      * fix async deps
      
      * follow comments and refine
      
      * fix deps connect for rpc ops
      b8da70c3