提交 06452f1a 编写于 作者: B bingyanghuang 提交者: Tao Luo

update int8 doc (#20738)

上级 419188d7
...@@ -42,25 +42,25 @@ We provide the results of accuracy and performance measured on Intel(R) Xeon(R) ...@@ -42,25 +42,25 @@ We provide the results of accuracy and performance measured on Intel(R) Xeon(R)
| Model | FP32 Accuracy | INT8 Accuracy | Accuracy Diff(FP32-INT8) | | Model | FP32 Accuracy | INT8 Accuracy | Accuracy Diff(FP32-INT8) |
| :----------: | :-------------: | :------------: | :--------------: | | :----------: | :-------------: | :------------: | :--------------: |
| GoogleNet | 70.50% | 69.81% | 0.69% | | GoogleNet | 70.50% | 70.08% | 0.42% |
| MobileNet-V1 | 70.78% | 70.42% | 0.36% | | MobileNet-V1 | 70.78% | 70.41% | 0.37% |
| MobileNet-V2 | 71.90% | 71.35% | 0.55% | | MobileNet-V2 | 71.90% | 71.34% | 0.56% |
| ResNet-101 | 77.50% | 77.42% | 0.08% | | ResNet-101 | 77.50% | 77.43% | 0.07% |
| ResNet-50 | 76.63% | 76.52% | 0.11% | | ResNet-50 | 76.63% | 76.57% | 0.06% |
| VGG16 | 72.08% | 72.03% | 0.05% | | VGG16 | 72.08% | 72.05% | 0.03% |
| VGG19 | 72.57% | 72.55% | 0.02% | | VGG19 | 72.57% | 72.57% | 0.00% |
>**II. Throughput on Intel(R) Xeon(R) Gold 6271 (batch size 1 on single core)** >**II. Throughput on Intel(R) Xeon(R) Gold 6271 (batch size 1 on single core)**
| Model | FP32 Throughput(images/s) | INT8 Throughput(images/s) | Ratio(INT8/FP32)| | Model | FP32 Throughput(images/s) | INT8 Throughput(images/s) | Ratio(INT8/FP32)|
| :-----------:| :------------: | :------------: | :------------: | | :-----------:| :------------: | :------------: | :------------: |
| GoogleNet | 34.06 | 72.79 | 2.14 | | GoogleNet | 32.76 | 67.43 | 2.06 |
| MobileNet-V1 | 80.02 | 230.65 | 2.88 | | MobileNet-V1 | 73.96 | 218.82 | 2.96 |
| MobileNet-V2 | 99.38 | 206.92 | 2.08 | | MobileNet-V2 | 87.94 | 193.70 | 2.20 |
| ResNet-101 | 7.38 | 27.31 | 3.70 | | ResNet-101 | 7.17 | 26.37 | 3.42 |
| ResNet-50 | 13.71 | 50.55 | 3.69 | | ResNet-50 | 13.26 | 48.72 | 3.67 |
| VGG16 | 3.64 | 10.56 | 2.90 | | VGG16 | 3.47 | 10.10 | 2.91 |
| VGG19 | 2.95 | 9.02 | 3.05 | | VGG19 | 2.82 | 8.68 | 3.07 |
* ## Prepare dataset * ## Prepare dataset
......
...@@ -65,12 +65,12 @@ Notes: ...@@ -65,12 +65,12 @@ Notes:
| Model | Fake QAT Original Throughput(images/s) | INT8 Throughput(images/s) | Ratio(INT8/FP32)| | Model | Fake QAT Original Throughput(images/s) | INT8 Throughput(images/s) | Ratio(INT8/FP32)|
| :-----------:| :-------------------------: | :------------: | :------------: | | :-----------:| :-------------------------: | :------------: | :------------: |
| MobileNet-V1 | 13.66 | 114.98 | 8.42 | | MobileNet-V1 | 12.86 | 118.05 | 9.18 |
| MobileNet-V2 | 10.22 | 79.78 | 7.81 | | MobileNet-V2 | 9.76 | 85.89 | 8.80 |
| ResNet101 | 2.65 | 18.97 | 7.16 | | ResNet101 | 2.55 | 19.40 | 7.61 |
| ResNet50 | 4.58 | 35.09 | 7.66 | | ResNet50 | 4.39 | 35.78 | 8.15 |
| VGG16 | 2.38 | 9.93 | 4.17 | | VGG16 | 2.26 | 9.89 | 4.38 |
| VGG19 | 2.03 | 8.53 | 4.20 | | VGG19 | 1.96 | 8.41 | 4.29 |
## 3. How to reproduce the results ## 3. How to reproduce the results
Three steps to reproduce the above-mentioned accuracy results, and we take ResNet50 benchmark as an example: Three steps to reproduce the above-mentioned accuracy results, and we take ResNet50 benchmark as an example:
...@@ -95,7 +95,7 @@ cd /PATH/TO/DOWNLOAD/MODEL/ ...@@ -95,7 +95,7 @@ cd /PATH/TO/DOWNLOAD/MODEL/
wget http://paddle-inference-dist.bj.bcebos.com/int8/${MODEL_FILE_NAME} wget http://paddle-inference-dist.bj.bcebos.com/int8/${MODEL_FILE_NAME}
``` ```
To download and verify all the 7 models, you need to set `MODEL_NAME` to one of the following values in command line: Unzip the downloaded model to the folder.To verify all the 7 models, you need to set `MODEL_NAME` to one of the following values in command line:
```text ```text
QAT MKL-DNN 1.0 QAT MKL-DNN 1.0
MODEL_NAME=ResNet50, ResNet101, GoogleNet, MobileNetV1, MobileNetV2, VGG16, VGG19 MODEL_NAME=ResNet50, ResNet101, GoogleNet, MobileNetV1, MobileNetV2, VGG16, VGG19
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册