1. 27 4月, 2021 1 次提交
  2. 26 4月, 2021 1 次提交
  3. 15 4月, 2021 1 次提交
    • T
      heterps support pscore (#32093) · 9f8c8f96
      Thunderbrook 提交于
      * pscore support heterps
      
      * fleet cmake
      
      * fleet wrapper
      
      * macro
      
      * solve conflict
      
      * solve conflict
      
      * add unitest
      
      * paddle enforce
      
      * unitest
      
      * unitest
      
      * unitest
      9f8c8f96
  4. 18 3月, 2021 1 次提交
  5. 24 2月, 2021 1 次提交
    • T
      fix entry (#31079) · ebbdf525
      tangwei12 提交于
      * fix entry
      
      * fix distributed lookup table fuse case
      
      * fix entry bug at first time
      
      * move entry from paddle.fluid -> paddle.distributed
      
      * fix ut with paddle.enable_static()
      Co-authored-by: Nmalin10 <malin10@baidu.com>
      ebbdf525
  6. 08 1月, 2021 1 次提交
  7. 24 12月, 2020 1 次提交
  8. 02 12月, 2020 1 次提交
    • Z
      Add pure fp16 training with master weights. (#27712) · be3777a5
      Zhen Wang 提交于
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      
      * For CI Coverage Checking.
      be3777a5
  9. 30 11月, 2020 1 次提交
  10. 28 10月, 2020 1 次提交
  11. 15 10月, 2020 1 次提交
  12. 14 10月, 2020 1 次提交
  13. 13 10月, 2020 1 次提交
    • C
      【paddle.fleet】Update fleetrun & ps-heter (#27472) · c5f2802d
      Chengmo 提交于
      * refine fleetrun.ps_launch
      
      * update fleet run for multi device support
      
      * ps_graph support ps-gpu
      
      * fix heter save
      
      * add heter save unittest
      
      * fix unittest & simple code
      
      * update fleetrun
      
      * fix fleetrun
      
      * fix launch barrier
      
      * fix role maker
      
      * add paddlecloud rolemaker unittest
      
      * rename heter_worker_device_guard
      c5f2802d
  14. 29 9月, 2020 1 次提交
  15. 23 9月, 2020 1 次提交
    • T
      large scale kv speedup (#26510) · bc5f0246
      tangwei12 提交于
      * rename communicator meet->BatchesCounter
      
      * fix parame recv for sparse
      
      * geo sparse init from pserver
      
      * optimize init from pserver
      
      * add large scale optimizer fuse(SGD/ADAM)
      
      * rectification init_worker and exe.run startup program
      bc5f0246
  16. 20 9月, 2020 1 次提交
    • T
      【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4
      tangwei12 提交于
      * fix fleet util and gloo
      
      * fix worker endpoints
      
      * fix
      
      * fix UT
      
      * fix gloo
      
      * fix gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * fix gloo wrapper for hdfs
      
      * add file gloo and UT
      
      * fix UT
      
      * fix UT
      
      * fix UT
      
      * hide public method of RoleMaker
      
      * fix UT
      
      * GPU fleetrun support gloo
      
      * parameterserver fleetrun support gloo
      
      * add UT
      
      * add UT
      
      * fix UT
      
      * fix get server endpoint
      
      * fix get server endpoint
      
      * fix UT
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * Update test_fleet_rolemaker_new.py
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      d6b54de4
  17. 08 9月, 2020 1 次提交
  18. 04 9月, 2020 1 次提交
  19. 02 9月, 2020 1 次提交
  20. 30 8月, 2020 1 次提交
  21. 21 8月, 2020 1 次提交
  22. 07 8月, 2020 1 次提交
  23. 30 7月, 2020 1 次提交