paddle/operators/matmul_op.cc · 4d112b7d0440fbcc1cf33bbb4a78d7ea473014fb · PaddlePaddle / Paddle

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

matmul_op.cc 6.9 KB

PaddlePaddle / Paddle 1 年多 前同步成功

Replace matmul_op.cc

PaddlePaddle / Paddle
1 年多前同步成功