“902f19b46a8bb0ae7712f4caea2421af17857251”上不存在“paddle/phi/kernels/log_softmax_grad_kernel.h”
  • M
    MatMul operator (#4856) · 16489827
    Markus Kliegl 提交于
    * initial matmul operator
    
    Similar to np.matmul, but also has transpose_X and transpose_Y flags,
    and only supports tensors from rank 1 to 3 inclusive.
    
    For GPU, uses cublas?gemmStridedBatched. For CPU, uses
    cblas_?gemm_batch if available via MKL; otherwise a simple serial
    implementation that loops over the batch dimension is employed for now.
    16489827
test_matmul_op.py 3.7 KB