paddle/operators/matmul_op.h · 164898277c3f274e6d57c4baceb00fe62a8b769c · Crayon鑫 / Paddle

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

matmul_op.h 7.9 KB

Crayon鑫 / Paddle 与 Fork 源项目一致

Replace matmul_op.h

Crayon鑫 / Paddle
与 Fork 源项目一致