“0560733c2e4492db5ae0af2553e7fd7b6d883007”上不存在“paddle/fluid/operators/expand_op.cu”
  • Y
    Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8
    Yihua Xu 提交于
    * Optimize layer_norm operator with AVX intrinsic functions
    
    * Revert the wrong modifications
    
    * Implement the jit kernel for layer_norm operator
    
    * Add math headfile to fix the compile issue (test=develop)
    
    * Add math headfile to fix the compile issue (test=develop)
    
    * Fixed the intrinsic headfile issue (test=develop)
    
    * Fix the conflicts (test=develop)
    
    * Revert for CUDA compiler (test=develop)
    
    * Fixed the cuda depency (test=develop)
    
    * Fix the marco issues (test=develop)
    f4c869d8
layer_norm_op.h 12.0 KB