[bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843)
* add layer norm * add p norm * add reduce sum * refine layer norm register bf16 for cudnn811 * add bf16 cast for hip * add unittest * refine rocm * refine layer_norm unittest * refine reduce op * refine unittest * enhance atol for reduce unittest
Showing
想要评论请 注册 或 登录