• B
    Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60
    Bob Zhu 提交于
    * extend matmul op to support multiple head multiplication
    
    With the support of multiple head, the multiplication of two big matrixes is
    split into multiplication of several (head_number) small matrixes. e.g. if
    Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
    as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
    [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
    220eef60
CMakeLists.txt 12.7 KB