1. 22 11月, 2018 1 次提交
    • C
      Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1
      chengduo 提交于
      * refine cublase
      test=develop
      
      * code refine
      
      * refine cublas
      
      * add GEMME_EX
      
      * add enable_cublas_tensor_op_math doc and add cublasCall
      test=develop
      
      * fix CublasCall for cuda version
      test=develop
      
      * fix error
      test=develop
      
      * fix GEMM_EX to be compatible with gcc 4.8
      test=develop
      
      * add GEMM_EX
      test=develop
      
      * to compatiable with gcc4.8
      test=develop
      00b9e9a1
  2. 28 9月, 2018 1 次提交
  3. 15 9月, 2018 1 次提交
  4. 21 8月, 2018 1 次提交
  5. 17 8月, 2018 1 次提交
  6. 01 6月, 2018 1 次提交
  7. 25 4月, 2018 1 次提交
  8. 11 4月, 2018 1 次提交
  9. 08 4月, 2018 1 次提交
  10. 07 4月, 2018 1 次提交
  11. 09 3月, 2018 1 次提交
    • K
      Add float16 GEMM math function on GPU (#8695) · 90215b78
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * initial commit
      
      * fix error
      
      * small fix
      
      * add more gemm fp16 tests
      
      * fix error
      
      * add utility function
      90215b78
  12. 12 2月, 2018 1 次提交
  13. 10 2月, 2018 2 次提交
  14. 11 11月, 2017 1 次提交
  15. 18 10月, 2017 1 次提交
    • M
      MatMul operator (#4856) · 16489827
      Markus Kliegl 提交于
      * initial matmul operator
      
      Similar to np.matmul, but also has transpose_X and transpose_Y flags,
      and only supports tensors from rank 1 to 3 inclusive.
      
      For GPU, uses cublas?gemmStridedBatched. For CPU, uses
      cblas_?gemm_batch if available via MKL; otherwise a simple serial
      implementation that loops over the batch dimension is employed for now.
      16489827
  16. 10 8月, 2017 3 次提交
  17. 13 7月, 2017 1 次提交
  18. 11 7月, 2017 2 次提交
  19. 04 7月, 2017 3 次提交
  20. 03 7月, 2017 2 次提交