Support Linear operation in cuBlaslt and plug into attn_gemm and fusedLinear backward op (#52028)
* first commit * restruct c++ interface to divide linear from matmulwithcublaslt * finish building in cublaslt impl * fix code bugs * fix host cost * add some changes
Showing
想要评论请 注册 或 登录