1. 05 2月, 2020 1 次提交
  2. 13 9月, 2019 1 次提交
    • C
      Open fuse all reduce option (#19765) · 056fdedd
      chengduo 提交于
      * Open fuse all reduce op
      test=develop
      
      * Add Fuse optimization op log
      
      * Add log in fuse_optimizer op pass and fuse all_reduce op pass
      
      * replace with boost::optional<bool>
      test=develop
      
      * Polish code
      test=develop
      
      * fix code coverage
      test=develop
      056fdedd
  3. 11 9月, 2019 1 次提交
  4. 28 8月, 2019 1 次提交
    • T
      Fix the correctness of async mode at distributed training (#18863) · 65c73684
      tangwei12 提交于
      * fix correctness of the communicator
      
      * fix a bug in send thread when sending var context is empty, test=develop
      
      * add lookup_table_prefetch_op and prefetch optimize, test=develop
      
      * remove remote prefetch GPU supported
      
      * word2vec force with CPU, test=develop
      
      * test dist remote lookup table force with CPU, test=develop
      65c73684
  5. 14 6月, 2019 1 次提交
  6. 27 5月, 2019 1 次提交
  7. 23 5月, 2019 1 次提交
  8. 08 5月, 2019 1 次提交
  9. 18 4月, 2019 1 次提交
  10. 01 4月, 2019 1 次提交
  11. 31 3月, 2019 2 次提交
  12. 28 3月, 2019 3 次提交
  13. 27 3月, 2019 1 次提交
  14. 20 3月, 2019 2 次提交
    • C
      Fuse AllReduce (#15921) · f26ba5bd
      chengduo 提交于
      * fuse all_reduce
      test=develop
      
      * add fuse_parameter_groups_size
      test=develop
      
      * Polish code
      test=develop
      
      * Fix travis-ci
      test=develop
      
      * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
      test=develop
      
      * Add SetGroupAccordingToMemorySize
      test=develop
      
      * fix multi_devices_graph
      test=develop
      
      * reset params_grads
      test=develop
      
      * Polish code
      test=develop
      f26ba5bd
    • W
      Collective ops (#15572) · 6382b62f
      Wu Yi 提交于
      * wip allreduce in op
      
      * wip
      
      * wip
      
      * wip
      
      * wip adding test
      
      * wip for conflict with mp mode
      
      * fix tests test=develop
      
      * fix cpu build test=develop
      
      * fix travis clang format test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * delete comment test=develop
      
      * fix cpplint test=develop
      
      * fix test=develop
      
      * follow comment test=develop
      
      * add file test=develop
      
      * fix build test=develop
      
      * update test=develop
      
      * to be compatible with sync_bn, and fix mp mode in develop test=develop
      6382b62f
  15. 08 3月, 2019 1 次提交
  16. 05 3月, 2019 1 次提交
  17. 21 2月, 2019 1 次提交
  18. 14 2月, 2019 3 次提交
  19. 12 2月, 2019 2 次提交
  20. 10 2月, 2019 1 次提交
  21. 17 1月, 2019 1 次提交
  22. 07 1月, 2019 1 次提交
    • C
      Refactor MultiDevSSAGraphBuilder (#15090) · eabb2105
      chengduo 提交于
      * Refactor ParallelExecutor
      test=develop
      
      * extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
      test=develop
      
      * Refactor MultiDevSSAGraphBuilder
      test=developt
      
      * Remove enable_data_balance
      test=develop
      
      * code refine
      test=develop
      
      * remove data balance
      test=develop
      
      * refine ScaleLossGradOp
      test=develop
      
      * remove uncessary file
      test=develop
      
      * code refine
      test=develop
      
      * modify  function name
      test=develop
      
      * follow comments
      test=develop
      
      * add is_distribution field
      test=develop
      
      * set is_distribution
      test=develop
      
      * fix DistSSAGraphBuilder
      test=develop
      eabb2105
  23. 27 12月, 2018 1 次提交
    • C
      [WIP] Refine MultiDevSSAGraph (#15040) · fe8495a7
      chengduo 提交于
      * refine parallel_exe
      test=develop
      
      * rename shared_var_device
      
      * code refine
      
      * add test_weight_decay
      
      * remove Sort
      test=develop
      
      * Add SortForReduce
      test=develop
      
      * code refine
      test=develop
      
      * follow comment
      test=develop
      fe8495a7
  24. 26 12月, 2018 1 次提交
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  25. 20 12月, 2018 3 次提交
  26. 22 11月, 2018 1 次提交
  27. 06 11月, 2018 1 次提交
  28. 29 10月, 2018 2 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
    • S
      move to pass · 45559d04
      sneaxiy 提交于
      test=develop
      45559d04
  29. 12 10月, 2018 1 次提交
  30. 21 9月, 2018 1 次提交