1. 20 3月, 2019 1 次提交
    • W
      Collective ops (#15572) · 6382b62f
      Wu Yi 提交于
      * wip allreduce in op
      
      * wip
      
      * wip
      
      * wip
      
      * wip adding test
      
      * wip for conflict with mp mode
      
      * fix tests test=develop
      
      * fix cpu build test=develop
      
      * fix travis clang format test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * delete comment test=develop
      
      * fix cpplint test=develop
      
      * fix test=develop
      
      * follow comment test=develop
      
      * add file test=develop
      
      * fix build test=develop
      
      * update test=develop
      
      * to be compatible with sync_bn, and fix mp mode in develop test=develop
      6382b62f
  2. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  3. 28 1月, 2019 1 次提交
  4. 26 12月, 2018 1 次提交
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  5. 20 12月, 2018 2 次提交
  6. 14 12月, 2018 1 次提交
  7. 12 12月, 2018 1 次提交
  8. 07 12月, 2018 1 次提交
  9. 06 12月, 2018 1 次提交
  10. 04 12月, 2018 1 次提交
  11. 26 11月, 2018 1 次提交
  12. 12 11月, 2018 1 次提交
  13. 08 11月, 2018 1 次提交
  14. 05 11月, 2018 1 次提交
  15. 08 9月, 2018 1 次提交
    • W
      Benchmark tool for imgnet (#12305) · f90c7865
      Wu Yi 提交于
      * support test using executor without reader
      
      * run imgnet
      
      * update fluid benchmark
      
      * wip
      
      * update
      
      * update all models
      
      * support pyreader
      
      * update
      
      * clean up
      
      * make profile batches contollable
      
      * update API.spec
      
      * update scripts
      
      * clean dockerfile
      
      * update
      
      * clean comments
      
      * add scope argument docstring
      
      * use num_trainers to determine nccl init comms
      f90c7865
  16. 14 6月, 2018 1 次提交
    • Q
      Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8
      Qiyang Min 提交于
      * 1. Create buddy allocator in each places before NcclBcast the variables
      2. Check the memory usage of ALL gpus rather than the first one
      
      * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
      2. NOTE the usage of NCCLGroupGuard
      
      * Remove the memory usage check of gpus
      
      * Fix code style
      046bb5c8
  17. 01 6月, 2018 1 次提交
  18. 14 5月, 2018 2 次提交
  19. 11 5月, 2018 1 次提交
  20. 07 5月, 2018 1 次提交
  21. 05 5月, 2018 1 次提交
  22. 04 5月, 2018 1 次提交
  23. 16 4月, 2018 1 次提交
  24. 11 4月, 2018 2 次提交
  25. 27 3月, 2018 2 次提交
  26. 21 3月, 2018 4 次提交
  27. 08 3月, 2018 1 次提交
  28. 07 3月, 2018 2 次提交
  29. 06 3月, 2018 2 次提交
  30. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  31. 10 2月, 2018 1 次提交