paddle/operators/math/matmul.h · 164898277c3f274e6d57c4baceb00fe62a8b769c · 溜达小灰鼠 / Paddle

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

matmul.h 4.2 KB

溜达小灰鼠 / Paddle 与 Fork 源项目一致

Replace matmul.h

溜达小灰鼠 / Paddle
与 Fork 源项目一致