• Y
    Optimze Gelu with MKL Erf function (#15770) · 676995c8
    Yihua Xu 提交于
    * Optimize for gelu operator
    
    * Set up the low accuracy mode of MKL ERF function.
    
    test=develop
    
    * Only enable MKLML ERF when OS is linux
    
    * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
    
    test=develop
    
    * Add the CUDA macro to avoid NVCC's compile issue.
    
    test=develop
    
    * Add the TODO comments for mklml library modification.
    
    test=develop
    
    * Clean Code
    
    test=develop
    
    * Add the comment of marco for NVCC compiler.
    
    test=develop
    676995c8
blas_impl.h 18.7 KB