• L
    Support Linear operation in cuBlaslt and plug into attn_gemm and fusedLinear forward op (#51124) · 2dfc3fa8
    limingshu 提交于
    * optimization for fused linear op
    
    * fix code format
    
    * optimization for linear fused forward
    
    * merge with develop
    
    * fix bugs for gemm_ephilog
    
    * package of cublaslt ephilogue type with enmu
    
    * final fix before code reviewing
    
    * fix missed fusedType typo
    
    * fix code according to review suggestions
    
    * fix windows ci error
    
    * change location of MatmulPlanner
    
    * add some changes for compiler error fix
    
    ---------
    2dfc3fa8
fused_gemm_epilogue_op.cc 14.2 KB