1. 12 1月, 2023 1 次提交
  2. 10 1月, 2023 2 次提交
  3. 09 1月, 2023 1 次提交
  4. 07 1月, 2023 1 次提交
    • R
      Enable standalone executor for fleet training (#49293) · 67fc8e93
      Ruibiao Chen 提交于
      * Enable standalone executor for fleet training
      
      * Update code
      
      * Replace use_standalone_executor utils in auto parallel
      
      * Update code
      
      * Diable standalone executor for test_pass_sharding
      
      * Update code
      
      * Set sequential run for auto parallel
      
      * Fix dist_attr bug
      
      * Set sequential run for auto parallel
      67fc8e93
  5. 06 1月, 2023 2 次提交
    • G
    • Y
      [Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      c7899074
  6. 04 1月, 2023 1 次提交
    • J
      [Auto Parallel-Performance] Sharding Comm Optimization (#48604) · 5592f8ad
      JZ-LIANG 提交于
      * remove deps and prior comm
      
      * grad comm fuse
      
      * add deps for amp&global norm
      
      * stage2 broadcast prior deps
      
      * stage2 grad overlap
      
      * stream_analyzer bugfix
      
      * overlap enable
      
      * dep op namescope
      
      * depend support multiple inputs
      
      * check finite deps
      
      * stage2 param comm overlap
      
      * Set kD2HStream
      
      * grad comm hierarchical
      
      * grad comm hierarchical
      
      * new unitest
      Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
      5592f8ad
  7. 30 12月, 2022 2 次提交
  8. 29 12月, 2022 1 次提交
  9. 28 12月, 2022 1 次提交
  10. 27 12月, 2022 1 次提交
  11. 26 12月, 2022 1 次提交
    • Y
      [Auto Parallel] Merge the python and c++ impls of ProcessMesh (#47503) · 1c0afa79
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Fix a bug
      1c0afa79
  12. 23 12月, 2022 1 次提交
  13. 21 12月, 2022 1 次提交
  14. 14 12月, 2022 1 次提交
    • Z
      [AutoParallel] recompute tuning (#48608) · 170a31f9
      zhaoyingli 提交于
      * [AutoParallel] recompute tuning
      
      * fix conflict
      
      * update comment
      
      * bug fix
      
      * update rc algo
      
      * tiny fix
      
      * fix clear process_group
      
      * remove comment
      
      * update segment print
      
      * fix import OpRole
      
      * adapt amp pass and grad_clip pass for opt_tuner
      
      * update tuning config
      
      * fix import
      
      * annotate recompute info on ops and upgrade recompute pass
      
      * add op_namescope for seed op
      
      * record reserved vars
      
      * fix recompute var's dist_attr
      
      * fix strategy unittest
      
      * adapt for fp16
      
      * update unittest
      
      * revert copy opt
      
      * update unittest
      
      * rename set_recompute_segments
      
      * fix unittest
      170a31f9
  15. 12 12月, 2022 1 次提交
  16. 09 12月, 2022 1 次提交
  17. 02 12月, 2022 1 次提交
    • S
      Fix fuse_gemm_epilogue (#47805) · 6efc2888
      Shijie 提交于
      * Fix fuse_gemm_epilogue
      
      * update tests
      
      * Update CMakeLists.txt
      
      * Update CMakeLists.txt
      
      * Update CMakeLists.txt
      
      * fix random seed
      
      * use assert_allclose
      
      * Update test_dist_fuse_gemm_epilogue_pass.py
      
      * Update cpp_pass.py
      
      * Update test_dist_fuse_gemm_epilogue_pass.py
      
      * fix codestyle
      
      * update seed and atol
      6efc2888
  18. 01 12月, 2022 1 次提交
  19. 29 11月, 2022 2 次提交
  20. 28 11月, 2022 1 次提交
  21. 25 11月, 2022 1 次提交
  22. 22 11月, 2022 1 次提交
  23. 18 11月, 2022 2 次提交
  24. 14 11月, 2022 2 次提交
  25. 10 11月, 2022 1 次提交
  26. 09 11月, 2022 1 次提交
  27. 08 11月, 2022 2 次提交
  28. 07 11月, 2022 2 次提交
  29. 03 11月, 2022 1 次提交
  30. 01 11月, 2022 1 次提交
  31. 23 10月, 2022 1 次提交
  32. 20 10月, 2022 1 次提交