“c12e970e274541e817aeac3d19024a09b1a46559”上不存在“develop/doc/api/v1/trainer_config_helpers/layers.html”
Optimize the layer_norm operator with AVX intrinsic function (#14417)
* Optimize layer_norm operator with AVX intrinsic functions * Revert the wrong modifications * Implement the jit kernel for layer_norm operator * Add math headfile to fix the compile issue (test=develop) * Add math headfile to fix the compile issue (test=develop) * Fixed the intrinsic headfile issue (test=develop) * Fix the conflicts (test=develop) * Revert for CUDA compiler (test=develop) * Fixed the cuda depency (test=develop) * Fix the marco issues (test=develop)
Showing
想要评论请 注册 或 登录