1. 17 4月, 2023 1 次提交
  2. 03 4月, 2023 1 次提交
  3. 30 3月, 2023 1 次提交
  4. 08 12月, 2022 1 次提交
    • G
      Clean fluid APIs in distributed and fleet files (#48851) · 911d6bb1
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Remove climits.
      
      * Clean fluid API in paddle/distributed and paddle/fleetx folders.
      Include following files:
      python/paddle/distributed/__init__.py
      python/paddle/distributed/collective.py
      python/paddle/distributed/fleet/utils/fs.py
      python/paddle/distributed/fleet/utils/hybrid_parallel_inference.py
      python/paddle/distributed/fleet/utils/hybrid_parallel_util.py
      python/paddle/distributed/fleet/utils/internal_storage.py
      python/paddle/distributed/launch/context/device.py
      python/paddle/distributed/parallel.py
      python/paddle/distributed/parallel_with_gloo.py
      python/paddle/distributed/spawn.py
      python/paddle/framework/__init__.py
      To be mentioned, 'paddle.fluid.dygraph.parallel.ParallelEnv'
       and 'fluid.framework.core' keeps unchanged in those files.
      ParallelEnv is used by paddle.fluid.dygraph.parallel.DataParallel.
      However, APIs in paddle.fluid.dygraph.parallel can't be
      migrated to paddle.distributed, as there exists cyclic import
      dependencies in modules like paddle.static, paddle.tensor. And
      'fluid.framework.core' will be changed to import framework.core
      after fluid.core is transmitted.
      
      * Change TODO authors.
      911d6bb1
  5. 29 11月, 2022 2 次提交
  6. 22 11月, 2022 1 次提交
  7. 08 11月, 2022 1 次提交
  8. 23 10月, 2022 1 次提交
  9. 19 10月, 2022 1 次提交
  10. 18 10月, 2022 1 次提交
  11. 12 10月, 2022 1 次提交
    • N
      [CodeStyle][F401] remove unused imports in python/paddle/distributed (#46758) · fe716a0b
      Nyakku Shigure 提交于
      * [CodeStyle][F401] remove unused import in python/paddle/distributed
      
      * remove pass
      
      * empty commit
      
      * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.
      
      Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.
      
      * Fix split import.
      
      Fix split import.
      
      * add noqa after meta_optimizers in factory
      
      * restort collective ops
      
      * expand `import *`
      
      * add noqa after required imports
      
      * try to fix APIs without core.ops
      
      * Revert "try to fix APIs without core.ops"
      
      This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb.
      
      * fix an increment
      
      * empty commit
      
      * add noqa after required imports
      
      * expand `import *`, fix ci error
      Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
      fe716a0b
  12. 27 9月, 2022 1 次提交
  13. 20 9月, 2022 1 次提交
    • R
      logger manager (#45909) · 264ad205
      Roc 提交于
      uniform logger manager in FleetAPI.
      hidde API under distributed/utils which users don't need.
      264ad205
  14. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8
  15. 20 5月, 2022 1 次提交
  16. 06 4月, 2022 1 次提交
  17. 21 10月, 2021 1 次提交
  18. 11 6月, 2021 1 次提交
  19. 01 6月, 2021 1 次提交
  20. 06 5月, 2021 1 次提交
  21. 27 4月, 2021 1 次提交
  22. 01 4月, 2021 1 次提交
  23. 05 3月, 2021 1 次提交
  24. 07 1月, 2021 1 次提交
  25. 05 1月, 2021 1 次提交
  26. 26 11月, 2020 1 次提交
  27. 24 11月, 2020 1 次提交
  28. 14 10月, 2020 1 次提交
  29. 29 9月, 2020 1 次提交
  30. 28 9月, 2020 2 次提交
  31. 31 8月, 2020 1 次提交
  32. 28 8月, 2020 1 次提交
    • C
      Add interface to launch parallel dygraph by multiprocessing (#26044) · 31f422ae
      Chen Weihang 提交于
      * add dygraph parallel run interface
      
      * polish implement & unified env property name
      
      * add print config arg
      
      * refactor init_parallel_env function
      
      * Compatible with multiprocessing and launch modes
      
      * set default trainer start port
      
      * support run in python 2
      
      * polish python2 support code
      
      * remove python2 support
      
      * refine launch import
      
      * polish dome design details
      
      * refactor api implemention & path
      
      * use new method _set_expected_place
      
      * add spawn unittest framework & mnist test
      
      * add more unittests & doc
      
      * fix unittest failed
      
      * polish english doc
      
      * self review and polish details
      
      * refactor code by reviewer's comments
      
      * fix unittest failed
      
      * fix parallel_env unittest
      
      * fix several typos
      
      * fix error introduced when fixing typos
      
      * add unpublic note for start_processes
      
      * polish details by xiaoguang's comment
      
      * verify correctly when spawn nprocs=-1
      
      * refactor spawn & init_parallel_env design
      
      * polish doc details
      
      * open spawn unittests
      
      * try to fix doc compile error
      
      * try to fix unknown doc format error
      
      * add skip unittest when not gpu
      31f422ae