paddle/fluid/operators/layer_norm_op.h · 2a84054372fbc8310cf98b0dedaaad849f78f9c7 · PaddlePaddle / Paddle

Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8

由 Yihua Xu 提交于 11月 19, 2018

* Optimize layer_norm operator with AVX intrinsic functions

* Revert the wrong modifications

* Implement the jit kernel for layer_norm operator

* Add math headfile to fix the compile issue (test=develop)

* Add math headfile to fix the compile issue (test=develop)

* Fixed the intrinsic headfile issue (test=develop)

* Fix the conflicts (test=develop)

* Revert for CUDA compiler (test=develop)

* Fixed the cuda depency (test=develop)

* Fix the marco issues (test=develop)

f4c869d8

layer_norm_op.h 12.0 KB

PaddlePaddle / Paddle 大约 1 年 前同步成功

Replace layer_norm_op.h

PaddlePaddle / Paddle
大约 1 年前同步成功