mma qk tensor_core (#48087)
* use mma for QK dot computing in fused_multi_transformer. * Update fused_multi_transformer_op.cu.h
Showing
想要评论请 注册 或 登录
* use mma for QK dot computing in fused_multi_transformer. * Update fused_multi_transformer_op.cu.h