1. 25 2月, 2019 1 次提交
  2. 22 2月, 2019 2 次提交
    • T
      Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
      tensor-tang 提交于
      * Revert "Optimze Gelu with MKL Erf function (#15770)"
      
      This reverts commit 676995c8.
      
      * test=develop
      ee2321de
    • Y
      Optimze Gelu with MKL Erf function (#15770) · 676995c8
      Yihua Xu 提交于
      * Optimize for gelu operator
      
      * Set up the low accuracy mode of MKL ERF function.
      
      test=develop
      
      * Only enable MKLML ERF when OS is linux
      
      * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
      
      test=develop
      
      * Add the CUDA macro to avoid NVCC's compile issue.
      
      test=develop
      
      * Add the TODO comments for mklml library modification.
      
      test=develop
      
      * Clean Code
      
      test=develop
      
      * Add the comment of marco for NVCC compiler.
      
      test=develop
      676995c8
  3. 20 2月, 2019 3 次提交
  4. 19 2月, 2019 7 次提交
  5. 15 2月, 2019 1 次提交
  6. 14 2月, 2019 1 次提交
  7. 12 2月, 2019 1 次提交
  8. 11 2月, 2019 1 次提交
  9. 01 2月, 2019 3 次提交
  10. 29 1月, 2019 1 次提交
  11. 25 1月, 2019 2 次提交
  12. 21 1月, 2019 8 次提交
  13. 18 1月, 2019 1 次提交
  14. 16 1月, 2019 1 次提交
    • Y
      Optimize while_op for test (#14764) · 568cc2ff
      Yiqun Liu 提交于
      * Simplify the compare op for CPU.
      
      * Use asynchronous tensor copy in reshape_op's kernel.
      
      * Optimize while_op for test, avoiding creating variables every time.
      test=develop
      
      * Enable the cache of kernel type and kernel function.
      test=develop
      
      * Enable profiling with gperftools.
      
      * Remove flags for testing, and fix the linking error.
      test=develop
      
      * Delete the codes of ChooseKernel.
      test=develop
      
      * Fix bug when preparing ExecutorPrepareContext for while_op.
      
      * Fix missing depending on grpc libraries.
      
      * Remove the redundant print.
      test=develop
      
      * Follow comments.
      
      * Remove the codes related to prepare the ExecutorPrepareContext for while_op.
      test=develop
      568cc2ff
  15. 10 1月, 2019 2 次提交
  16. 09 1月, 2019 2 次提交
  17. 08 1月, 2019 3 次提交