paddle/fluid/operators/math/jit_kernel.h · f4c869d872a62d99cfbbd3e3c5c5d0cf2db4d863 · PaddlePaddle / Paddle

Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8

由 Yihua Xu 提交于 11月 19, 2018

* Optimize layer_norm operator with AVX intrinsic functions

* Revert the wrong modifications

* Implement the jit kernel for layer_norm operator

* Add math headfile to fix the compile issue (test=develop)

* Add math headfile to fix the compile issue (test=develop)

* Fixed the intrinsic headfile issue (test=develop)

* Fix the conflicts (test=develop)

* Revert for CUDA compiler (test=develop)

* Fixed the cuda depency (test=develop)

* Fix the marco issues (test=develop)

f4c869d8

jit_kernel.h 4.1 KB

PaddlePaddle / Paddle 大约 1 年 前同步成功

Replace jit_kernel.h

PaddlePaddle / Paddle
大约 1 年前同步成功