add mobile speed

56c5f7ce · sibo2rr · b3488eb7 · 56c5f7ce · 56c5f7ce
Showing with 42 addition and 25 deletion

docs/zh_CN/algorithm_introduction/ImageNet_models.md docs/zh_CN/algorithm_introduction/ImageNet_models.md +20 -20

docs/zh_CN/models/PP-LCNet.md docs/zh_CN/models/PP-LCNet.md +22 -5

未找到文件。
--- a/docs/zh_CN/algorithm_introduction/ImageNet_models.md
+++ b/docs/zh_CN/algorithm_introduction/ImageNet_models.md
--- a/docs/zh_CN/models/PP-LCNet.md
+++ b/docs/zh_CN/models/PP-LCNet.md
@@ -15,10 +15,12 @@
   - [4.2 目标检测](#4.2)
   - [4.3 语义分割](#4.3)
 - [5. 基于 V100 GPU 的预测速度](#5)
- [6. 总结](#6)
- [7. 引用](#7)
+- [6. 基于 SD855 的预测速度](#6)
+- [7. 总结](#7)
+- [8. 引用](#8)

 <a name="1"></a>
+
 ## 1. 摘要

 在计算机视觉领域中，骨干网络的好坏直接影响到整个视觉任务的结果。在之前的一些工作中，相关的研究者普遍将 FLOPs 或者 Params 作为优化目的，但是在工业界真实落地的场景中，推理速度才是考量模型好坏的重要指标，然而，推理速度和准确性很难兼得。考虑到工业界有很多基于 Intel CPU 的应用，所以我们本次的工作旨在使骨干网络更好的适应 Intel CPU，从而得到一个速度更快、准确率更高的轻量级骨干网络，与此同时，目标检测、语义分割等下游视觉任务的性能也同样得到提升。
@@ -167,13 +169,28 @@ MobileNetV3-large-0.75x | 25.8 | 11.1 |

 <a name="6"></a>

-## 6. 总结
+## 6. 基于 SD855 的预测速度

-PP-LCNet 没有像学术界那样死扣极致的 FLOPs 与 Params，而是着眼于分析如何添加对 Intel CPU 友好的模块来提升模型的性能，这样可以更好的平衡准确率和推理时间，其中的实验结论也很适合其他网络结构设计的研究者，同时也为 NAS 搜索研究者提供了更小的搜索空间和一般结论。最终的 PP-LCNet 在产业界也可以更好的落地和应用。
+| Models        | SD855 time(ms)<br>bs=1, thread=1 | SD855 time(ms)<br/>bs=1, thread=2 | SD855 time(ms)<br/>bs=1, thread=4 |
+| ------------- | -------------------------------- | --------------------------------- | --------------------------------- |
+| PPLCNet_x0_25 | 2.30                             | 1.62                              | 1.32                              |
+| PPLCNet_x0_35 | 3.15                             | 2.11                              | 1.64                              |
+| PPLCNet_x0_5  | 4.27                             | 2.73                              | 1.92                              |
+| PPLCNet_x0_75 | 7.38                             | 4.51                              | 2.91                              |
+| PPLCNet_x1_0  | 10.78                            | 6.49                              | 3.98                              |
+| PPLCNet_x1_5  | 20.55                            | 12.26                             | 7.54                              |
+| PPLCNet_x2_0  | 33.79                            | 20.17                             | 12.10                             |
+| PPLCNet_x2_5  | 49.89                            | 29.60                             | 17.82                             |

 <a name="7"></a>

-## 7. 引用
+## 7. 总结
+
+PP-LCNet 没有像学术界那样死扣极致的 FLOPs 与 Params，而是着眼于分析如何添加对 Intel CPU 友好的模块来提升模型的性能，这样可以更好的平衡准确率和推理时间，其中的实验结论也很适合其他网络结构设计的研究者，同时也为 NAS 搜索研究者提供了更小的搜索空间和一般结论。最终的 PP-LCNet 在产业界也可以更好的落地和应用。
+
+<a name="8"></a>
+
+## 8. 引用

 如果你的论文用到了 PP-LCNet 的方法，请添加如下 cite：
 ```