Introducing MKL to softmax for inference (!14437) · 合并请求 · PaddlePaddle / Paddle

Introducing MKL to softmax for inference !14437

Created by: jczaja

This PR is introducing MKL based execution of softmax operator.

Capi DAM test's profiling shows ~2 times improvement in softmax op execution with this optimization. Num threads: 1 Batch: 1,8,32,128 Platform: Intel(R) Xeon(R) Platinum 8180 CPU @ 2.50GHz

Notes:

Optimization is enabled when Paddle is configured with: ON_INFER = ON flag
To have unit test for it , just build with ON_INFER=ON and run test_softmax_op.py

PaddlePaddle / Paddle 大约 2 年 前同步成功

Introducing MKL to softmax for inference !14437

PaddlePaddle / Paddle
大约 2 年前同步成功