1. 11 1月, 2023 1 次提交
    • Y
      add FusedLinear pass (#49606) · 0f08a432
      yuehuayingxueluo 提交于
      * add FusedLinear pass
      
      * add fused_op_list and renname PASSES to OP_FUSION
      
      * add fused_passes_list to constants.py
      
      * add test_passes.py
      
      * fix test_fused_passes.py
      
      * fix add if float(paddle.version.cuda()) >= 11.6:
      
      * renamed test_fused_passes.py
      
      * fix CMakeList.txt
      0f08a432
  2. 10 1月, 2023 1 次提交
  3. 07 1月, 2023 1 次提交
    • R
      Enable standalone executor for fleet training (#49293) · 67fc8e93
      Ruibiao Chen 提交于
      * Enable standalone executor for fleet training
      
      * Update code
      
      * Replace use_standalone_executor utils in auto parallel
      
      * Update code
      
      * Diable standalone executor for test_pass_sharding
      
      * Update code
      
      * Set sequential run for auto parallel
      
      * Fix dist_attr bug
      
      * Set sequential run for auto parallel
      67fc8e93
  4. 06 1月, 2023 1 次提交
    • Y
      [Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      c7899074
  5. 30 12月, 2022 2 次提交
  6. 27 12月, 2022 1 次提交
  7. 25 12月, 2022 1 次提交
  8. 21 12月, 2022 1 次提交
  9. 14 12月, 2022 1 次提交
    • Z
      [AutoParallel] recompute tuning (#48608) · 170a31f9
      zhaoyingli 提交于
      * [AutoParallel] recompute tuning
      
      * fix conflict
      
      * update comment
      
      * bug fix
      
      * update rc algo
      
      * tiny fix
      
      * fix clear process_group
      
      * remove comment
      
      * update segment print
      
      * fix import OpRole
      
      * adapt amp pass and grad_clip pass for opt_tuner
      
      * update tuning config
      
      * fix import
      
      * annotate recompute info on ops and upgrade recompute pass
      
      * add op_namescope for seed op
      
      * record reserved vars
      
      * fix recompute var's dist_attr
      
      * fix strategy unittest
      
      * adapt for fp16
      
      * update unittest
      
      * revert copy opt
      
      * update unittest
      
      * rename set_recompute_segments
      
      * fix unittest
      170a31f9
  10. 29 11月, 2022 1 次提交
  11. 18 11月, 2022 1 次提交
  12. 09 11月, 2022 1 次提交
  13. 07 11月, 2022 1 次提交
  14. 28 10月, 2022 1 次提交
  15. 23 10月, 2022 1 次提交
  16. 21 10月, 2022 1 次提交
  17. 20 10月, 2022 1 次提交
  18. 18 10月, 2022 2 次提交
  19. 14 10月, 2022 2 次提交
  20. 12 10月, 2022 2 次提交
    • Y
      [Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a
      Yulong Ao 提交于
      * [Auto Parallel] Suppport different dataloaders
      
      * [Auto Parallel] Add num_shards config for dataset
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Add the prepare API and replace __call__ with run
      
      * [Auto Parallel] Improve the private implementations of Engine
      
      * [Auto Parallel] Set capacity of dataloader for opt tuning
      
      * [Auto Parallel] [WIP] Change the fine-grained API
      
      * [Auto Parallel] Improve APIs to support different user cases
      
      * [Auto Parallel] Add removed config
      
      * [Auto Parallel] Add imports
      
      * [Auto Parallel] Fix bugs for to_static
      
      * [Auto Parallel] Remove unnecessary imports
      686fa07a
    • N
      [CodeStyle][F401] remove unused imports in python/paddle/distributed (#46758) · fe716a0b
      Nyakku Shigure 提交于
      * [CodeStyle][F401] remove unused import in python/paddle/distributed
      
      * remove pass
      
      * empty commit
      
      * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.
      
      Fix ValueError: list.remove(x): x not in list for meta_optimizer_names.
      
      * Fix split import.
      
      Fix split import.
      
      * add noqa after meta_optimizers in factory
      
      * restort collective ops
      
      * expand `import *`
      
      * add noqa after required imports
      
      * try to fix APIs without core.ops
      
      * Revert "try to fix APIs without core.ops"
      
      This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb.
      
      * fix an increment
      
      * empty commit
      
      * add noqa after required imports
      
      * expand `import *`, fix ci error
      Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
      fe716a0b
  21. 10 10月, 2022 1 次提交
  22. 28 9月, 2022 1 次提交
  23. 27 9月, 2022 1 次提交
  24. 20 9月, 2022 1 次提交
  25. 19 9月, 2022 1 次提交
  26. 17 9月, 2022 1 次提交
  27. 16 9月, 2022 1 次提交
  28. 15 9月, 2022 1 次提交
    • Y
      [Auto Parallel] Improve the APIs (#45776) · b042a3b1
      Yulong Ao 提交于
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Add the serialization process for dist attrs
      
      * [Auto Parallel] Remove unnecessary comments
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix the code style
      
      * [Auto Parallel] Remove unnecessary impls
      
      * [Auto Parallel] Fix the importing error
      
      * [Auto Parallel] Fix the copy from bugs of op dist attr
      
      * [Auto Parallel] Replace the use of constexpr if
      
      * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh
      
      * [Auto Parallel] Change API of the completion unittest
      
      * [Auto Parallel] Fix the bug when set_attr an int
      
      * [Auto Parallel] Add the unittest for the serialization
      
      * [Auto Parallel] Add some unit tests
      
      * [Auto Paralle] Unify the strategy
      
      * [Auto Parallel] Improve the engine api
      
      * [Auto Parallel] Reset the changes made to the framework
      
      * [Auto Parallel] Change the engine unittest
      
      * [Auto Parallel] Update API of the completion and partitioner
      
      * [Auto Parallel] Update unit tests using engine api
      
      * update shard annotation
      
      * [Auto Parallel] Remove the modifications of other modules
      
      * [Auto Parallel] Add docs for APIs
      
      * add new strategy
      
      * [Auto Parallel] Replace the logger
      
      * [Auto Parallel] Restore the test_program.py
      
      * [Auto Parallel] Change the import rules
      
      * [Auto Parallel] Add the examples for Engine
      
      * [Auto Parallel] Do some minor changes
      
      * [Auto Parallel] Remove yaml dependency
      
      * [Auto Parallel] Fix the unittests
      
      * add valid after train
      
      * bug fix
      Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
      b042a3b1
  29. 14 9月, 2022 1 次提交
  30. 09 9月, 2022 2 次提交
  31. 07 9月, 2022 1 次提交
  32. 23 8月, 2022 2 次提交
  33. 18 8月, 2022 1 次提交
  34. 15 8月, 2022 1 次提交