Update PIPELINE_SERVING.md

536643b1 · TeslaZhao · GitHub · ef457423 · 536643b1
隐藏空白更改
内联并排

Showing with 6 addition and 6 deletion

doc/PIPELINE_SERVING.md doc/PIPELINE_SERVING.md +6 -6

未找到文件。
--- a/doc/PIPELINE_SERVING.md
+++ b/doc/PIPELINE_SERVING.md
@@ -2,12 +2,12 @@

 ([简体中文](PIPELINE_SERVING_CN.md)|English)

- [Architecture Design](PIPELINE_SERVING.md#1.Architecture_Design)
- [Detailed Design](PIPELINE_SERVING.md#2.Detailed_Design)
- [Classic Examples](PIPELINE_SERVING.md#3.Classic_Examples)
- [Advanced Usages](PIPELINE_SERVING.md#4.Advanced_Usages)
- [Log Tracing](PIPELINE_SERVING.md#5.Log_Tracing)
- [Performance Analysis And Optimization](PIPELINE_SERVING.md#6.Performance_analysis_and_optimization)
+- [Architecture Design](PIPELINE_SERVING.md#1Architecture_Design)
+- [Detailed Design](PIPELINE_SERVING.md#2Detailed_Design)
+- [Classic Examples](PIPELINE_SERVING.md#3Classic_Examples)
+- [Advanced Usages](PIPELINE_SERVING.md#4Advanced_Usages)
+- [Log Tracing](PIPELINE_SERVING.md#5Log_Tracing)
+- [Performance Analysis And Optimization](PIPELINE_SERVING.md#6Performance_analysis_and_optimization)

 In many deep learning frameworks,  Serving is usually used for the deployment of single model.but in the context of AI industrial, the end-to-end deep learning model can not solve all the problems at present. Usually, it is necessary to use multiple deep learning models to solve practical problems.However, the design of multi-model applications is complicated. In order to reduce the difficulty of development and maintenance, and to ensure the availability of services, serial or simple parallel methods are usually used. In general, the throughput only reaches the usable state and the GPU utilization rate is low.