“6784688a8c97c0142cd85f4349bbf9880e9c94b0”上不存在“doc_cn/design/cluster_train/remote_parameter_updater.html”
[bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843)
* add layer norm * add p norm * add reduce sum * refine layer norm register bf16 for cudnn811 * add bf16 cast for hip * add unittest * refine rocm * refine layer_norm unittest * refine reduce op * refine unittest * enhance atol for reduce unittest
Showing
想要评论请 注册 或 登录