Created by: lidanqing-intel
This PR improved mobilenetv2 INT8 performance with good accuracy.
We have to use relu instead of brelu as the post-op in INT8 conv2d kernel , because INT8 brelu as a post-op is not enabled in mkldnn v01.8. I add TODO and comments of what will be changed when v0.20 is enabled.
The performance of mobilenetv2 with this PR is follows:
INT8/FP32 | Top1 Accuracy | Performance |
---|---|---|
FP32 | 71.90% | X |
INT8 | 71.43% | 1.92 X |
test machine: Intel(R) Core(TM) i9-7940X CPU @ 3.10GHz, 14 Cores paddle_num_threads 14
Performance of all the int8 models on CLX and SKX will be delivered by this Friday.