1. 26 3月, 2019 1 次提交
  2. 22 3月, 2019 2 次提交
    • C
      [Speed]Refine ParallelExecutor (#16190) · a6a3b2fb
      chengduo 提交于
      * refine parallelExecutor
      test=develop
      
      * Polish op_handle
      test=develop
      
      * Remove unnecessary op_handle
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * Fix fetch bug
      test=develop
      
      * Remove WaitInputVarGenerated
      
      * Fix OpHandleBase::Run
      test=develop
      
      * debug
      test=develop
      
      * use origin fetch_op_handle
      test=develop
      
      * Revert op_handle_base.cc
      test=develop
      
      * Polish code
      test=develop
      
      * Fix OpHandleBase::Run
      test=develop
      
      * code refine
      
      * test CI and CE
      test=develop
      
      * fix OpHandle::Run
      test=develop
      
      * refine AllReduceOpHandle
      test=develop
      
      * Polish code
      test=develop
      a6a3b2fb
    • C
      Add unit test for fuse all reduce (#16354) · 33965527
      chengduo 提交于
      * refine fused_all_reduce_op
      
      * add unit test in test_parallel_executor_seresnext
      test=develop
      33965527
  3. 20 3月, 2019 2 次提交
    • C
      Fuse AllReduce (#15921) · f26ba5bd
      chengduo 提交于
      * fuse all_reduce
      test=develop
      
      * add fuse_parameter_groups_size
      test=develop
      
      * Polish code
      test=develop
      
      * Fix travis-ci
      test=develop
      
      * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
      test=develop
      
      * Add SetGroupAccordingToMemorySize
      test=develop
      
      * fix multi_devices_graph
      test=develop
      
      * reset params_grads
      test=develop
      
      * Polish code
      test=develop
      f26ba5bd
    • W
      Collective ops (#15572) · 6382b62f
      Wu Yi 提交于
      * wip allreduce in op
      
      * wip
      
      * wip
      
      * wip
      
      * wip adding test
      
      * wip for conflict with mp mode
      
      * fix tests test=develop
      
      * fix cpu build test=develop
      
      * fix travis clang format test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * delete comment test=develop
      
      * fix cpplint test=develop
      
      * fix test=develop
      
      * follow comment test=develop
      
      * add file test=develop
      
      * fix build test=develop
      
      * update test=develop
      
      * to be compatible with sync_bn, and fix mp mode in develop test=develop
      6382b62f
  4. 19 3月, 2019 1 次提交
  5. 18 3月, 2019 2 次提交
  6. 15 3月, 2019 3 次提交
  7. 14 3月, 2019 1 次提交
  8. 12 3月, 2019 1 次提交
  9. 07 3月, 2019 2 次提交
  10. 06 3月, 2019 4 次提交
  11. 05 3月, 2019 2 次提交
    • S
      enhance gc · 597dc65e
      sneaxiy 提交于
      test=develop
      597dc65e
    • L
      add IfElse test case for ir memory optimize (#15998) · caadd058
      liuwei1031 提交于
      * add ir memory optimize test case for IfElse op, test=develop
      
      * fix some unitttest failure by force using the python memory_optimize, test=develop
      
      * tweak comments, test=develop
      
      * fix unittest, test=develop
      
      * fix unittest, test=develop
      caadd058
  12. 04 3月, 2019 1 次提交
  13. 01 3月, 2019 1 次提交
  14. 25 2月, 2019 1 次提交
  15. 23 2月, 2019 1 次提交
  16. 22 2月, 2019 7 次提交
  17. 21 2月, 2019 3 次提交
    • X
      allow compiler to use graph · 26e32e09
      Xin Pan 提交于
      test=develop
      26e32e09
    • Q
      fix use gpu test=develop · 62f1248f
      Qiao Longfei 提交于
      62f1248f
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  18. 19 2月, 2019 5 次提交