“b7efc45506419e9984414d7009d5aae60b9cbb53”上不存在“projects/547291213/imports.yml”
  • M
    MatMul operator (#4856) · 16489827
    Markus Kliegl 提交于
    * initial matmul operator
    
    Similar to np.matmul, but also has transpose_X and transpose_Y flags,
    and only supports tensors from rank 1 to 3 inclusive.
    
    For GPU, uses cublas?gemmStridedBatched. For CPU, uses
    cblas_?gemm_batch if available via MKL; otherwise a simple serial
    implementation that loops over the batch dimension is employed for now.
    16489827
math_function.cc 9.5 KB