1. 25 12月, 2020 1 次提交
    • T
      2 0 ps core 2 (#29894) · f781ab08
      tangwei12 提交于
      * add ps table (#29463)
      
      * add ps table
      
      Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178
      
      * add service (#29560)
      
      * add service, remove ut on mac
      
      * fix heter_profiler & add heter stop method
      
      * fix code style
      
      * merge pscore
      
      Change-Id: Ie7f60d1cdde6755a0c29db26863c6283e9843d57
      
      * fix cmake
      
      Change-Id: I6773509a7b4ca79139ecc40b7bf3eb318ceff8bb
      
      * fix conflit
      
      Change-Id: I35575be0c96a8520f9d756ea7f1ff0b904a165ba
      
      * fix conflit
      
      Change-Id: Ic926ea0b0d67803226d51241397ba3b510226bfa
      f781ab08
  2. 22 12月, 2020 1 次提交
  3. 17 12月, 2020 1 次提交
    • S
      [cherry-pick]fix matmulv2 bug & add rebuild group & fix bug of download (#29726) · df0430dc
      ShenLiang 提交于
      * Fix the dowanload bug in the case of multiple machines (#29551)
      
      * fix the dowanload bug
      * add sort for ips
      
      * Fix bug of matmul_v2 for broadcast case (#29599)
      
      * fix bug of matmul_v2 for broadcast
      
      * Rebuild group automatically in dynamic graph distributed (#29255)
      
      * add tensor_indices in AssignGroupBySize
      
      * add rebuild group in reducer
      
      * fix error message of gather nd (#29521)
      df0430dc
  4. 08 12月, 2020 1 次提交
  5. 04 12月, 2020 1 次提交
  6. 03 12月, 2020 2 次提交
  7. 01 12月, 2020 1 次提交
  8. 27 11月, 2020 2 次提交
  9. 26 11月, 2020 2 次提交
    • J
      [sharding] doc, api, bug fixed (#28983) · 0dadacc4
      JZ-LIANG 提交于
      * add lars to fleet meta optimizer
      
      * add lamb to proto
      
      * add lamb to fleet meta optimizer
      
      * fixed syntax bug
      
      * fixed syntax bug
      
      * fixed syntax error in lamb, add config setter of lamb in distributed_strategy
      
      * trigger unitest to rerun
      
      * add new unitest func for lamb
      
      * revise unitest for lars and lamb
      
      * revise dgc meta unitest
      
      * revise lars document in distribute_strategy
      
      * revise lars lamb document in distributed_strategy.py
      
      * revise lars lamb document in distributed_strategy.py
      
      * add weight decay exclude logic to lars
      
      * restore optimzier.py
      
      * restore optimizer.py as develop except lars
      
      * add epsilon and exclude fn to distributed_sttrategy
      
      * add lars epsilon
      
      * revise unitest for fleet lars and lamb
      
      * revise lars lamb unitest for CI coverage
      
      * revise lars argument api
      
      * revise lars argument api
      
      * revise lars argument api
      
      * revise api doc of lars
      
      * fix op role
      
      * add sharding save and add_sync_comm_for_test function
      
      * add comm_analyse to utlis
      
      * revise sharding_utils
      
      * add sharding saving unittest
      
      * revise sharding utils for unittest
      
      * revise sharding en doc
      
      * update sharding utils api
      
      * add doc for sharding
      
      * fixed bug in sharding var size count
      
      * update varsize count in sharding
      
      * fix sharding num_nccl_comm
      
      * Revert "fix sharding num_nccl_comm"
      
      This reverts commit d51587c15e9323acf226ddd36154275f0d1daf76.
      0dadacc4
    • L
      fix the bug in gloo (#29112) · 2a864c70
      lilong12 提交于
      * update, test=develop
      2a864c70
  10. 24 11月, 2020 1 次提交
    • L
      Upgrade string literals to raw string (#28989) · 3815d7aa
      Leo Chen 提交于
      * upgrade comment string to raw string
      
      * fix string in
      
      * fix string with ' '
      
      * revert update on comments
      
      * upgrade only necessary
      
      * fix sample code checker
      
      * fix comments with '''
      3815d7aa
  11. 26 10月, 2020 1 次提交
  12. 22 10月, 2020 1 次提交
  13. 19 10月, 2020 1 次提交
  14. 16 10月, 2020 1 次提交
  15. 15 10月, 2020 3 次提交
  16. 14 10月, 2020 2 次提交
  17. 13 10月, 2020 2 次提交
  18. 12 10月, 2020 1 次提交
  19. 30 9月, 2020 2 次提交
  20. 29 9月, 2020 2 次提交
  21. 28 9月, 2020 4 次提交
  22. 27 9月, 2020 1 次提交
  23. 25 9月, 2020 1 次提交
  24. 24 9月, 2020 1 次提交
  25. 23 9月, 2020 1 次提交
  26. 20 9月, 2020 1 次提交
    • T
      【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4
      tangwei12 提交于
      * fix fleet util and gloo
      
      * fix worker endpoints
      
      * fix
      
      * fix UT
      
      * fix gloo
      
      * fix gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * fix gloo wrapper for hdfs
      
      * add file gloo and UT
      
      * fix UT
      
      * fix UT
      
      * fix UT
      
      * hide public method of RoleMaker
      
      * fix UT
      
      * GPU fleetrun support gloo
      
      * parameterserver fleetrun support gloo
      
      * add UT
      
      * add UT
      
      * fix UT
      
      * fix get server endpoint
      
      * fix get server endpoint
      
      * fix UT
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * Update test_fleet_rolemaker_new.py
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      d6b54de4
  27. 18 9月, 2020 1 次提交
  28. 17 9月, 2020 1 次提交