• Y
    [Cherry-pick][Auto Parallel] Improve the APIs (#46164) · c5cc4278
    Yulong Ao 提交于
    * [AutoParallel] adapt gradient merge pass (#45915)
    
    * adapt gradient merge
    
    * fix op_role
    
    * fix strategy
    
    * [Auto Parallel] Gradient Fuse Allreduce (#45643)
    
    * bugfix (#45332)
    
    * dist embedding support lookup table v1
    
    * add unitest
    
    * customize wait_comm
    
    * group gradients
    
    * bugfix
    
    * update program
    
    * [Auto Parallel] Improve the APIs (#45776)
    
    * [Auto Parallel] Use c++ dist attr in the completion process
    
    * [Auto Parallel] Add minor changes
    
    * [Auto Parallel] Use c++ dist attr in the completion process
    
    * [Auto Parallel] Add minor changes
    
    * [Auto Parallel] Add the serialization process for dist attrs
    
    * [Auto Parallel] Remove unnecessary comments
    
    * [Auto Parallel] Fix some bugs
    
    * [Auto Parallel] Fix the code style
    
    * [Auto Parallel] Remove unnecessary impls
    
    * [Auto Parallel] Fix the importing error
    
    * [Auto Parallel] Fix the copy from bugs of op dist attr
    
    * [Auto Parallel] Replace the use of constexpr if
    
    * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh
    
    * [Auto Parallel] Change API of the completion unittest
    
    * [Auto Parallel] Fix the bug when set_attr an int
    
    * [Auto Parallel] Add the unittest for the serialization
    
    * [Auto Parallel] Add some unit tests
    
    * [Auto Paralle] Unify the strategy
    
    * [Auto Parallel] Improve the engine api
    
    * [Auto Parallel] Reset the changes made to the framework
    
    * [Auto Parallel] Change the engine unittest
    
    * [Auto Parallel] Update API of the completion and partitioner
    
    * [Auto Parallel] Update unit tests using engine api
    
    * update shard annotation
    
    * [Auto Parallel] Remove the modifications of other modules
    
    * [Auto Parallel] Add docs for APIs
    
    * add new strategy
    
    * [Auto Parallel] Replace the logger
    
    * [Auto Parallel] Restore the test_program.py
    
    * [Auto Parallel] Change the import rules
    
    * [Auto Parallel] Add the examples for Engine
    
    * [Auto Parallel] Do some minor changes
    
    * [Auto Parallel] Remove yaml dependency
    
    * [Auto Parallel] Fix the unittests
    
    * add valid after train
    
    * bug fix
    Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
    Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
    Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
    
    * [Auto Parallel] Bugfix allreduce fuse for MP (#46086)
    
    * bugfix
    
    * bugfix
    
    * typos fixed
    
    * update strategy (#46138)
    Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com>
    Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>
    Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
    Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
    Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
    c5cc4278
test_auto_parallel_reshard.py 13.4 KB