1. 22 3月, 2019 1 次提交
    • C
      [Speed]Refine ParallelExecutor (#16190) · a6a3b2fb
      chengduo 提交于
      * refine parallelExecutor
      test=develop
      
      * Polish op_handle
      test=develop
      
      * Remove unnecessary op_handle
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * Fix fetch bug
      test=develop
      
      * Remove WaitInputVarGenerated
      
      * Fix OpHandleBase::Run
      test=develop
      
      * debug
      test=develop
      
      * use origin fetch_op_handle
      test=develop
      
      * Revert op_handle_base.cc
      test=develop
      
      * Polish code
      test=develop
      
      * Fix OpHandleBase::Run
      test=develop
      
      * code refine
      
      * test CI and CE
      test=develop
      
      * fix OpHandle::Run
      test=develop
      
      * refine AllReduceOpHandle
      test=develop
      
      * Polish code
      test=develop
      a6a3b2fb
  2. 20 3月, 2019 2 次提交
    • C
      Fuse AllReduce (#15921) · f26ba5bd
      chengduo 提交于
      * fuse all_reduce
      test=develop
      
      * add fuse_parameter_groups_size
      test=develop
      
      * Polish code
      test=develop
      
      * Fix travis-ci
      test=develop
      
      * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
      test=develop
      
      * Add SetGroupAccordingToMemorySize
      test=develop
      
      * fix multi_devices_graph
      test=develop
      
      * reset params_grads
      test=develop
      
      * Polish code
      test=develop
      f26ba5bd
    • W
      Collective ops (#15572) · 6382b62f
      Wu Yi 提交于
      * wip allreduce in op
      
      * wip
      
      * wip
      
      * wip
      
      * wip adding test
      
      * wip for conflict with mp mode
      
      * fix tests test=develop
      
      * fix cpu build test=develop
      
      * fix travis clang format test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * delete comment test=develop
      
      * fix cpplint test=develop
      
      * fix test=develop
      
      * follow comment test=develop
      
      * add file test=develop
      
      * fix build test=develop
      
      * update test=develop
      
      * to be compatible with sync_bn, and fix mp mode in develop test=develop
      6382b62f
  3. 22 2月, 2019 3 次提交
  4. 21 2月, 2019 1 次提交
  5. 19 2月, 2019 2 次提交
  6. 18 2月, 2019 1 次提交
  7. 14 2月, 2019 3 次提交
  8. 12 2月, 2019 2 次提交
  9. 07 2月, 2019 1 次提交
  10. 10 1月, 2019 1 次提交
  11. 07 1月, 2019 1 次提交
    • C
      Refactor MultiDevSSAGraphBuilder (#15090) · eabb2105
      chengduo 提交于
      * Refactor ParallelExecutor
      test=develop
      
      * extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
      test=develop
      
      * Refactor MultiDevSSAGraphBuilder
      test=developt
      
      * Remove enable_data_balance
      test=develop
      
      * code refine
      test=develop
      
      * remove data balance
      test=develop
      
      * refine ScaleLossGradOp
      test=develop
      
      * remove uncessary file
      test=develop
      
      * code refine
      test=develop
      
      * modify  function name
      test=develop
      
      * follow comments
      test=develop
      
      * add is_distribution field
      test=develop
      
      * set is_distribution
      test=develop
      
      * fix DistSSAGraphBuilder
      test=develop
      eabb2105
  12. 28 12月, 2018 1 次提交
  13. 27 12月, 2018 1 次提交
    • C
      [WIP] Refine MultiDevSSAGraph (#15040) · fe8495a7
      chengduo 提交于
      * refine parallel_exe
      test=develop
      
      * rename shared_var_device
      
      * code refine
      
      * add test_weight_decay
      
      * remove Sort
      test=develop
      
      * Add SortForReduce
      test=develop
      
      * code refine
      test=develop
      
      * follow comment
      test=develop
      fe8495a7
  14. 26 12月, 2018 2 次提交
    • Y
      cleanup code · 845bfd58
      Yancey1989 提交于
      845bfd58
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  15. 20 12月, 2018 3 次提交
  16. 17 12月, 2018 2 次提交
  17. 14 12月, 2018 1 次提交
  18. 13 12月, 2018 1 次提交
  19. 12 12月, 2018 1 次提交
  20. 07 12月, 2018 1 次提交
  21. 06 12月, 2018 1 次提交
  22. 04 12月, 2018 1 次提交
  23. 29 11月, 2018 1 次提交
  24. 26 11月, 2018 2 次提交
  25. 22 11月, 2018 1 次提交
  26. 08 11月, 2018 3 次提交
    • P
      merge from develop · dcfab111
      peizhilin 提交于
      dcfab111
    • C
      Fix input<tensor> (#14208) · c5b6573a
      chengduo 提交于
      * fix input<tensor>
      test=develop
      
      * fix split_ids
      test=develop
      
      * ElementwiseMul should not support SelectedRows
      
      * fix scale op
      test=develop
      
      * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
      
      * fix operator
      
      * refine MultiOutput
      
      * fix MultiOutput
      test=develop
      
      * disable test_dist_save_load
      test=develop
      
      * fix elementwise_op
      test=develop
      
      * add get_sparse_as_op
      test=develop
      
      * add info for check
      test=develop
      
      * rename get_sparse_as_op with extract_rows_as_op.
      test=develop
      
      * elementwise doesn't support selected_rows
      
      * fix regularizer
      
      * remove extract_rows_as
      test=develop
      
      * fix ci
      test=develop
      
      * add test for sum_op
      
      * fix regularizer
      test=develop
      
      *  test=develop
      
      * fix pserver weight decay multi inputs test=develop
      c5b6573a
    • M
      Change the origin VLOG level to 10 times · 0c3227a5
      minqiyang 提交于
      Fix code to support cpplint syntax check
      
      test=develop
      0c3227a5