exclude lr scheduler's state from accumulators_holder (#33984)
* exclude lr scheduler's state from accumulators_holder * fix when there is no learning rate scheduler * make a copy of the loaded state dict to avoid modifying it
Showing
想要评论请 注册 或 登录