[NPU] Add sync_batch_norm and sync_batch_norm_grad NPU Kernel (#36320)
* add sync_batch_norm (support train, infer, and fp32, fp16, and NCHW, NHWC) * [NPU] Delete debug codes * [NPU] Remove FP16
Showing
此差异已折叠。
想要评论请 注册 或 登录
* add sync_batch_norm (support train, infer, and fp32, fp16, and NCHW, NHWC) * [NPU] Delete debug codes * [NPU] Remove FP16