Support Linear operation in cuBlaslt and plug into attn_gemm and fusedLinear forward op (#51124)
* optimization for fused linear op * fix code format * optimization for linear fused forward * merge with develop * fix bugs for gemm_ephilog * package of cublaslt ephilogue type with enmu * final fix before code reviewing * fix missed fusedType typo * fix code according to review suggestions * fix windows ci error * change location of MatmulPlanner * add some changes for compiler error fix ---------
Showing
想要评论请 注册 或 登录