1. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  2. 28 1月, 2019 1 次提交
  3. 26 12月, 2018 1 次提交
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  4. 20 12月, 2018 2 次提交
  5. 14 12月, 2018 1 次提交
  6. 12 12月, 2018 1 次提交
  7. 07 12月, 2018 1 次提交
  8. 06 12月, 2018 1 次提交
  9. 04 12月, 2018 1 次提交
  10. 26 11月, 2018 1 次提交
  11. 12 11月, 2018 1 次提交
  12. 08 11月, 2018 1 次提交
  13. 05 11月, 2018 1 次提交
  14. 08 9月, 2018 1 次提交
    • W
      Benchmark tool for imgnet (#12305) · f90c7865
      Wu Yi 提交于
      * support test using executor without reader
      
      * run imgnet
      
      * update fluid benchmark
      
      * wip
      
      * update
      
      * update all models
      
      * support pyreader
      
      * update
      
      * clean up
      
      * make profile batches contollable
      
      * update API.spec
      
      * update scripts
      
      * clean dockerfile
      
      * update
      
      * clean comments
      
      * add scope argument docstring
      
      * use num_trainers to determine nccl init comms
      f90c7865
  15. 14 6月, 2018 1 次提交
    • Q
      Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8
      Qiyang Min 提交于
      * 1. Create buddy allocator in each places before NcclBcast the variables
      2. Check the memory usage of ALL gpus rather than the first one
      
      * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
      2. NOTE the usage of NCCLGroupGuard
      
      * Remove the memory usage check of gpus
      
      * Fix code style
      046bb5c8
  16. 01 6月, 2018 1 次提交
  17. 14 5月, 2018 2 次提交
  18. 11 5月, 2018 1 次提交
  19. 07 5月, 2018 1 次提交
  20. 05 5月, 2018 1 次提交
  21. 04 5月, 2018 1 次提交
  22. 16 4月, 2018 1 次提交
  23. 11 4月, 2018 2 次提交
  24. 27 3月, 2018 2 次提交
  25. 21 3月, 2018 4 次提交
  26. 08 3月, 2018 1 次提交
  27. 07 3月, 2018 2 次提交
  28. 06 3月, 2018 2 次提交
  29. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  30. 10 2月, 2018 2 次提交