to avoid cause issues for unset no_weight_decay models.
there seems be a diff for optimizer about using [] and [{"params":}, {"params":}] params
Showing
想要评论请 注册 或 登录
there seems be a diff for optimizer about using [] and [{"params":}, {"params":}] params