use mkl to accelerate gelu_grad (!18099) · 合并请求 · PaddlePaddle / Paddle

use mkl to accelerate gelu_grad !18099

Created by: LeoZhao-Intel

Native gelu_grad use EIGEN to do calculation, but it doesn't utilize vector acceleration. This PR uses MKL Vector to accelerate like it does in gelu.

Remaining issue is there is variable allocation/de-allocation for each iteration which is used to store intermediate values, any good idea to avoid this allocation?

test=develop

PaddlePaddle / Paddle 接近 2 年 前同步成功

use mkl to accelerate gelu_grad !18099

PaddlePaddle / Paddle
接近 2 年前同步成功