1. 30 8月, 2023 1 次提交
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
  2. 25 8月, 2023 1 次提交
  3. 07 8月, 2023 1 次提交
  4. 01 8月, 2023 1 次提交
  5. 24 5月, 2023 1 次提交
  6. 16 5月, 2023 1 次提交
  7. 14 2月, 2023 1 次提交
  8. 10 1月, 2023 1 次提交
  9. 09 1月, 2023 1 次提交
  10. 24 6月, 2022 1 次提交
    • G
      Fix hang bug of TCPStore (#43724) · 4c9330d6
      gongweibao 提交于
      * tmp fix
      
      * init
      
      * compile ok
      
      * compile ok
      
      * add vlogs
      
      * add test
      
      * fix termination error
      
      * add testfile
      
      * add
      
      * fix window compile
      
      * fix window compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix kunlun compile
      
      * fix compilation
      
      * fix compilation
      
      * fix compilation
      
      * tmp fix
      
      * add windows
      
      * add windows
      
      * add more logs
      
      * change timeout to protected
      
      * SB
      
      * add
      
      * add
      
      * fix timeout
      
      * add
      
      * fix test
      
      * fix test
      
      * fix test
      
      * fix ut
      
      * fix ut
      
      * fix ut
      4c9330d6
  11. 05 6月, 2022 1 次提交
  12. 13 5月, 2022 1 次提交
  13. 07 3月, 2022 1 次提交
  14. 03 3月, 2022 1 次提交
  15. 22 2月, 2022 1 次提交
  16. 30 7月, 2020 1 次提交
  17. 30 9月, 2019 1 次提交
  18. 23 5月, 2019 1 次提交
  19. 07 4月, 2018 1 次提交
  20. 12 2月, 2018 1 次提交
  21. 10 2月, 2018 2 次提交
  22. 26 12月, 2017 1 次提交
  23. 28 9月, 2017 1 次提交
  24. 27 9月, 2017 1 次提交
  25. 25 7月, 2017 1 次提交
  26. 17 7月, 2017 2 次提交
  27. 11 7月, 2017 2 次提交
  28. 06 7月, 2017 2 次提交
  29. 05 7月, 2017 1 次提交
  30. 04 7月, 2017 4 次提交
  31. 03 7月, 2017 1 次提交
  32. 28 6月, 2017 2 次提交