[NPU] refine update_loss_scaling npu kernel (#32580)
* refine update_loss_scaling npu kernel * add mutable_data * change Zerolike op to MemcpyAsync * delete useless code * add found_inf_vec * add memcpy if not finite * fix unittest
Showing
想要评论请 注册 或 登录