1. 01 3月, 2021 1 次提交
    • T
      [Cherry pick] cherry-pick #31102 #30750 #30626 (#31336) · ff4612a3
      Thunderbrook 提交于
      * solve build gpu task core (#30626)
      
      * build gpu task core
      
      * format
      
      * dump to cpu (#30750)
      
      * dump to cpu
      
      * format
      
      * format
      
      * format
      
      * support multi node in heterps (#31102)
      
      * push multi node
      
      * multi node
      
      * MultiThread
      
      * remove log
      
      * solve bug in 30829
      
      * optimizer
      ff4612a3
  2. 25 2月, 2021 1 次提交
  3. 12 1月, 2021 1 次提交
  4. 05 1月, 2021 1 次提交
  5. 29 12月, 2020 1 次提交
  6. 25 12月, 2020 1 次提交
    • T
      2 0 ps core 2 (#29894) · f781ab08
      tangwei12 提交于
      * add ps table (#29463)
      
      * add ps table
      
      Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178
      
      * add service (#29560)
      
      * add service, remove ut on mac
      
      * fix heter_profiler & add heter stop method
      
      * fix code style
      
      * merge pscore
      
      Change-Id: Ie7f60d1cdde6755a0c29db26863c6283e9843d57
      
      * fix cmake
      
      Change-Id: I6773509a7b4ca79139ecc40b7bf3eb318ceff8bb
      
      * fix conflit
      
      Change-Id: I35575be0c96a8520f9d756ea7f1ff0b904a165ba
      
      * fix conflit
      
      Change-Id: Ic926ea0b0d67803226d51241397ba3b510226bfa
      f781ab08
  7. 03 12月, 2020 1 次提交
    • Z
      [Cherry-pick] Add pure fp16 training with master weights. (#29301) · d8ea8a06
      Zhen Wang 提交于
      * Add pure fp16 training with master weights. (#27712)
      
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      d8ea8a06
  8. 30 11月, 2020 2 次提交
  9. 27 11月, 2020 1 次提交
  10. 24 11月, 2020 1 次提交
    • L
      Upgrade string literals to raw string (#28989) · 3815d7aa
      Leo Chen 提交于
      * upgrade comment string to raw string
      
      * fix string in
      
      * fix string with ' '
      
      * revert update on comments
      
      * upgrade only necessary
      
      * fix sample code checker
      
      * fix comments with '''
      3815d7aa
  11. 23 11月, 2020 1 次提交
    • T
      support ps-gpu (#28752) · 0073f9bd
      Thunderbrook 提交于
      * ps gpu transpile
      
      * ps gpu
      
      * remove op
      
      * gps trainer
      
      * local ps
      
      * add macro
      
      * HeterBox
      
      * def cuda
      
      * tab
      
      * code style
      
      * style
      
      Co-authored-by: Thunderbrook <a754913769#163.com>
      0073f9bd
  12. 28 10月, 2020 1 次提交
  13. 19 10月, 2020 1 次提交
  14. 15 10月, 2020 1 次提交
  15. 14 10月, 2020 2 次提交
    • C
      【paddle.fleet】fix sparse load (#27680) · 328cb289
      Chengmo 提交于
      * add sparse tensor load method
      328cb289
    • Z
      Multi task (#26002) · 5a83496c
      zhang wenhui 提交于
      * add multitask
      
      * add multitask, test=develop
      
      * fix code style, test=develop
      
      * add partail push dense, test=develop
      
      * fix has_kay in py3, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      5a83496c
  16. 13 10月, 2020 1 次提交
    • C
      【paddle.fleet】Update fleetrun & ps-heter (#27472) · c5f2802d
      Chengmo 提交于
      * refine fleetrun.ps_launch
      
      * update fleet run for multi device support
      
      * ps_graph support ps-gpu
      
      * fix heter save
      
      * add heter save unittest
      
      * fix unittest & simple code
      
      * update fleetrun
      
      * fix fleetrun
      
      * fix launch barrier
      
      * fix role maker
      
      * add paddlecloud rolemaker unittest
      
      * rename heter_worker_device_guard
      c5f2802d
  17. 29 9月, 2020 2 次提交
  18. 28 9月, 2020 4 次提交
  19. 23 9月, 2020 1 次提交
    • T
      large scale kv speedup (#26510) · bc5f0246
      tangwei12 提交于
      * rename communicator meet->BatchesCounter
      
      * fix parame recv for sparse
      
      * geo sparse init from pserver
      
      * optimize init from pserver
      
      * add large scale optimizer fuse(SGD/ADAM)
      
      * rectification init_worker and exe.run startup program
      bc5f0246
  20. 20 9月, 2020 1 次提交
    • T
      【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4
      tangwei12 提交于
      * fix fleet util and gloo
      
      * fix worker endpoints
      
      * fix
      
      * fix UT
      
      * fix gloo
      
      * fix gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * fix gloo wrapper for hdfs
      
      * add file gloo and UT
      
      * fix UT
      
      * fix UT
      
      * fix UT
      
      * hide public method of RoleMaker
      
      * fix UT
      
      * GPU fleetrun support gloo
      
      * parameterserver fleetrun support gloo
      
      * add UT
      
      * add UT
      
      * fix UT
      
      * fix get server endpoint
      
      * fix get server endpoint
      
      * fix UT
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * Update test_fleet_rolemaker_new.py
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      d6b54de4
  21. 17 9月, 2020 1 次提交
  22. 16 9月, 2020 1 次提交
  23. 08 9月, 2020 1 次提交
  24. 04 9月, 2020 1 次提交
  25. 02 9月, 2020 1 次提交
  26. 31 8月, 2020 1 次提交
  27. 30 8月, 2020 1 次提交
  28. 21 8月, 2020 1 次提交
  29. 19 8月, 2020 1 次提交
  30. 13 8月, 2020 1 次提交
  31. 10 8月, 2020 1 次提交
  32. 08 8月, 2020 1 次提交
  33. 07 8月, 2020 2 次提交