• H
    support offload in sharding stage2 (#37904) · dfed4a63
    Haohongxiang 提交于
    * merge latest develop branch
    
    * fix bugs
    
    * update
    
    * fix bugs for unittest
    
    * modify for less use of gpu mem
    
    * fix bugs of using _reset_grad_inplace_version
    
    * update
    
    * update
    
    * modify for CI-Coverage
    
    * retrick all CIs
    dfed4a63
sharding_utils.py 7.2 KB