paddle/operators/math/matmul.h · b325213150e2ded487a93b1a552113267f13cfb6 · PaddlePaddle / Paddle

“be757096dab84b1b8777cb2fe3f55907f4aefb02”上不存在“paddle/phi/kernels/gpu/reduce_max_kernel.cu”

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

matmul.h 4.2 KB

PaddlePaddle / Paddle 大约 2 年 前同步成功

Replace matmul.h

PaddlePaddle / Paddle
大约 2 年前同步成功