1. 24 7月, 2019 1 次提交
    • B
      Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60
      Bob Zhu 提交于
      * extend matmul op to support multiple head multiplication
      
      With the support of multiple head, the multiplication of two big matrixes is
      split into multiplication of several (head_number) small matrixes. e.g. if
      Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
      as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
      [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
      220eef60
  2. 04 3月, 2019 1 次提交
  3. 26 2月, 2019 1 次提交
  4. 22 2月, 2019 2 次提交
    • T
      Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
      tensor-tang 提交于
      * Revert "Optimze Gelu with MKL Erf function (#15770)"
      
      This reverts commit 676995c8.
      
      * test=develop
      ee2321de
    • Y
      Optimze Gelu with MKL Erf function (#15770) · 676995c8
      Yihua Xu 提交于
      * Optimize for gelu operator
      
      * Set up the low accuracy mode of MKL ERF function.
      
      test=develop
      
      * Only enable MKLML ERF when OS is linux
      
      * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
      
      test=develop
      
      * Add the CUDA macro to avoid NVCC's compile issue.
      
      test=develop
      
      * Add the TODO comments for mklml library modification.
      
      test=develop
      
      * Clean Code
      
      test=develop
      
      * Add the comment of marco for NVCC compiler.
      
      test=develop
      676995c8
  5. 13 12月, 2018 1 次提交
  6. 27 11月, 2018 1 次提交
  7. 16 11月, 2018 1 次提交
  8. 13 11月, 2018 1 次提交
  9. 22 8月, 2018 5 次提交
  10. 16 8月, 2018 1 次提交
  11. 06 8月, 2018 1 次提交
  12. 03 8月, 2018 2 次提交
  13. 05 7月, 2018 2 次提交
  14. 27 6月, 2018 1 次提交
  15. 20 6月, 2018 1 次提交
  16. 24 5月, 2018 1 次提交
  17. 21 5月, 2018 1 次提交
  18. 08 5月, 2018 2 次提交
  19. 07 5月, 2018 1 次提交
  20. 04 5月, 2018 1 次提交