Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (!17570) · 合并请求 · PaddlePaddle / Paddle

Improve mobilenetv2 INT8 performance by using INT8 relu as post-op !17570

Created by: lidanqing-intel

This PR improved mobilenetv2 INT8 performance with good accuracy.

We have to use relu instead of brelu as the post-op in INT8 conv2d kernel , because INT8 brelu as a post-op is not enabled in mkldnn v01.8. I add TODO and comments of what will be changed when v0.20 is enabled.

The performance of mobilenetv2 with this PR is follows:

INT8/FP32	Top1 Accuracy	Performance
FP32	71.90%	X
INT8	71.43%	1.92 X

test machine: Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz, 14 Cores paddle_num_threads 14

Performance of all the int8 models on CLX and SKX will be delivered by this Friday.

PaddlePaddle / Paddle 1 年多 前同步成功

Improve mobilenetv2 INT8 performance by using INT8 relu as post-op !17570

PaddlePaddle / Paddle
1 年多前同步成功