“aa1085ddc54c5dc40ae91468d5f2a2a7ecdf85cc”上不存在“paddle/phi/kernels/gelu_kernel.h”
  • Y
    Optimze Gelu with MKL Erf function (#15770) · 676995c8
    Yihua Xu 提交于
    * Optimize for gelu operator
    
    * Set up the low accuracy mode of MKL ERF function.
    
    test=develop
    
    * Only enable MKLML ERF when OS is linux
    
    * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
    
    test=develop
    
    * Add the CUDA macro to avoid NVCC's compile issue.
    
    test=develop
    
    * Add the TODO comments for mklml library modification.
    
    test=develop
    
    * Clean Code
    
    test=develop
    
    * Add the comment of marco for NVCC compiler.
    
    test=develop
    676995c8
blas_impl.h 18.7 KB