support offload in sharding stage2 (#37904)
* merge latest develop branch * fix bugs * update * fix bugs for unittest * modify for less use of gpu mem * fix bugs of using _reset_grad_inplace_version * update * update * modify for CI-Coverage * retrick all CIs
Showing
想要评论请 注册 或 登录