• Y
    Add auto completion module for auto parallel (#34813) · 93d862b0
    Yulong Ao 提交于
    * add auto_parallel dir
    
    * mv to paddle.distributed
    
    * add shard_xx api
    
    * add distributed attrs for var
    
    * add ut, test=develop
    
    * add dist
    
    * update
    
    * update
    
    * update
    
    * update
    
    * update
    
    * update, test=develop
    
    * update, test=develop
    
    * update, test=develop
    
    * update, test=develop
    
    * update, test=develop
    
    * update, test=develop
    
    * update, test=develop
    
    * update
    
    * update
    
    * update
    
    * update
    
    * update
    
    * update, test=develop
    
    * update, test=develop
    
    * update
    
    * update
    
    * delete unused proto
    
    * resotre op_desc
    
    * restore type_defs
    
    * update var_desc
    
    * remove dimss_mapping for proto_pybind
    
    * update interface.py
    
    * update framework.py
    
    * update
    
    * update
    
    * add auto_parallel dir
    
    * mv to paddle.distributed
    
    * add shard_xx api
    
    * add distributed attrs for var
    
    * add ut, test=develop
    
    * [WIP] Add the auto completion feature and related codes
    
    * [WIP] Improve the auto completion and related codes
    
    * [WIP] Make the auto completion to support data-parallel
    
    * [WIP] Make the completion support mp and dp+mp
    
    * [WIP] Refactor auto completion unit test for MLP
    
    * [WIP] Refactor the implementation of DistributedOperatorImpl
    
    * [WIP] Improve dims_mapping update rule and fix a bug
    
    * [WIP] Support auto completion for one transformer decoder layer
    
    * [WIP] Add a minor change
    
    * [WIP] Fix a bug within the uint test
    
    * Shard XShape tensor, add embedding completion and refactor code
    
    * Add the distributed_operators dir to setup.py.in
    
    * Improve the completion process and add the unittest for gpt
    
    * fix process_mesh ut
    
    * fix process_mesh ut
    
    * update
    
    * update, test=develop
    
    * Add support for automatically completing distributed attrs of special ops
    
    * update
    
    * update
    
    * update
    
    * fix doc sample codes, test=develop
    
    * improve coverage, test=develop
    
    * add static_mode check, test=develop
    
    * Model the cluster for cost model and physical mapping
    
    * update, test=develop
    
    * add set_placement, test=develop
    
    * Add the check to make sure the candidate tensors' size is great than zero
    
    * update doc, test=develop
    
    * update doc, test=develop
    
    * update doc, test=develop
    
    * update doc, test=develop
    
    * update, test=develop
    
    * Auto mark dist attrs annotated by user
    
    * update ndarray to nested list, test=develop
    
    * update, test=develop
    
    * Add auto-completion module for auto-parallel (based on PR#33804)
    
    * Remove unnecessary files
    
    * Remove unrelated files for the auto completion pr
    
    * Update the unit test to improve the coverage
    
    * Modify codes based on reviews
    
    * Minor changes for CI
    
    * Improve some codes based on new comments
    
    * Fix bugs caused by shallow copy in attributes.py
    * Imporve amend_distributed_attr_for_program in context.py
    * Other changes for weihang's comments
    Co-authored-by: Nsandyhouse <lilong12@baidu.com>
    93d862b0
op_desc.h 5.4 KB