add fp32 grad plus fp16 param in adamw (#51141)
* add fp32 grad plus fp16 param in adamw * add python UT * fix test case * in test_adamw_op py file, force the moment2 value LE 0 * add a compare option * remove bf16 fused adam kernel case
Showing
想要评论请 注册 或 登录