Added matmul_v2 BF16/FP32 FWD kernel (#33750)
* added matmul_v2 bf16/fp32 FWD kernel added matmul_v2 bf16/fp32 FWD kernel * added formatting * removed some tests due to timeout in CI * refactored tests * merged tests classes into one file * minor change * removed test guard for CUDA * remove skipIf * changes after review * formated one file * minor change * added skipping UT in CUDA place
Showing
想要评论请 注册 或 登录