Merge pull request #1510 from felixhjh/dev_2

update README.md

Merge pull request #1510 from felixhjh/dev_2
update README.md
92a0d410 · TeslaZhao · bjjwwang · a8101704 · 92a0d410 · 92a0d410
438 changed file
--- a/doc/C++_Serving/ABTest_CN.md
+++ b/doc/C++_Serving/ABTest_CN.md
@@ -11,7 +11,7 @@
 ### 下载数据以及模型

 ``` shell
-cd Serving/python/examples/imdb
+cd Serving/examples/C++/imdb
 sh get_data.sh
 ```


--- a/doc/C++_Serving/ABTest_EN.md
+++ b/doc/C++_Serving/ABTest_EN.md
@@ -11,7 +11,7 @@ Note that:  A/B Test is only applicable to RPC mode, not web mode.
 ### Download Data and Models

 ```shell
-cd Serving/python/examples/imdb
+cd Serving/examples/C++/imdb
 sh get_data.sh
 ```

@@ -103,8 +103,8 @@ Due to different network conditions, the results of each prediction may be sligh
 ```

 <!--
-cp ../Serving/python/examples/imdb/get_data.sh .
-cp ../Serving/python/examples/imdb/imdb_reader.py .
+cp ../../examples/C++/imdb/get_data.sh .
+cp ../../examples/C++/imdb/imdb_reader.py .
 pip install -U paddle_serving_server
 pip install -U paddle_serving_client
 pip install -U paddlepaddle

--- a/doc/C++_Serving/Http_Service_CN.md
+++ b/doc/C++_Serving/Http_Service_CN.md
@@ -40,7 +40,7 @@ python3.6 -m paddle_serving_server.serve --model uci_housing_model --thread 10 -

 为了方便用户快速的使用Http方式请求Server端预测服务，我们已经将常用的Http请求的数据体封装、压缩、请求加密等功能封装为一个HttpClient类提供给用户，方便用户使用。

-使用HttpClient最简单只需要四步，1、创建一个HttpClient对象。2、加载Client端的prototxt配置文件（本例中为python/examples/fit_a_line/目录下的uci_housing_client/serving_client_conf.prototxt)。3、调用connect函数。4、调用Predict函数，通过Http方式请求预测服务。
+使用HttpClient最简单只需要四步，1、创建一个HttpClient对象。2、加载Client端的prototxt配置文件（本例中为examples/C++/fit_a_line目录下的uci_housing_client/serving_client_conf.prototxt)。3、调用connect函数。4、调用Predict函数，通过Http方式请求预测服务。

 此外，您可以根据自己的需要配置Server端IP、Port、服务名称（此服务名称需要与[`core/general-server/proto/general_model_service.proto`](../../core/general-server/proto/general_model_service.proto)文件中的Service服务名和rpc方法名对应，即`GeneralModelService`字段和`inference`字段），设置Request数据体压缩，设置Response支持压缩传输，模型加密预测（需要配置Server端使用模型加密）、设置响应超时时间等功能。


--- a/doc/Cube_Local_CN.md
+++ b/doc/Cube_Local_CN.md
@@ -12,7 +12,7 @@


 ## 示例
-在python/example/criteo_ctr_with_cube下执行
+在Serving/examples/C++/PaddleRec/criteo_ctr_with_cube/下执行
 ```
 python local_train.py # 训练模型
 cp ../../../build_server/core/predictor/seq_generator seq_generator #复制Sequence File模型生成工具
@@ -96,7 +96,7 @@ cd cube


 ## 注： 配置文件
-以python/examples/criteo_ctr_with_cube/cube/conf下的cube.conf示例，此文件被上述的cube-cli所使用，单机版用户可以直接使用不用关注此部分，它在分布式部署中更为重要。
+以Serving/examples/C++/PaddleRec/criteo_ctr_with_cube/cube/conf下的cube.conf示例，此文件被上述的cube-cli所使用，单机版用户可以直接使用不用关注此部分，它在分布式部署中更为重要。

 ```
 [{

--- a/doc/Cube_Local_EN.md
+++ b/doc/Cube_Local_EN.md
@@ -12,7 +12,7 @@ The local mode of Cube is different from distributed Cube, which is designed to
 This document uses the original model without any compression algorithm. If there is a need for a quantitative model to go online, please read the [Quantization Storage on Cube Sparse Parameter Indexing](./Cube_Quant_EN.md)

 ## Example
-in directory python/example/criteo_ctr_with_cube, run
+in directory Serving/examples/C++/PaddleRec/criteo_ctr_with_cube, run

 ```
 python local_train.py # train model
@@ -95,7 +95,7 @@ If you see that each key has a corresponding value output, it means that the del


 ## Appendix: Configuration
-the config file is cube.config located in python/examples/criteo_ctr_with_cube/cube/conf, this file is used by cube-cli.the Cube Local Mode users do not need to understand that just use it, it would be quite important in Cube Distributed Mode.
+the config file is cube.config located in Serving/examples/C++/PaddleRec/criteo_ctr_with_cube/cube/conf, this file is used by cube-cli.the Cube Local Mode users do not need to understand that just use it, it would be quite important in Cube Distributed Mode.

 ```
 [{

--- a/doc/Cube_Quant_CN.md
+++ b/doc/Cube_Quant_CN.md
@@ -22,7 +22,7 @@
 在Serving主目录下，到criteo_ctr_with_cube目录下训练出模型

 ```
-cd python/examples/criteo_ctr_with_cube
+cd Serving/examples/C++/PaddleRec/criteo_ctr_with_cube
 python local_train.py # 生成模型
 ```
 接下来可以使用量化和非量化两种方式去生成Sequence File用于Cube稀疏参数索引。
@@ -34,11 +34,11 @@ seq_generator ctr_serving_model/SparseFeatFactors ./cube_model/feature 8 #量化

 ## 用量化模型启动Serving

-在Serving当中，使用general_dist_kv_quant_infer op来进行预测时使用量化模型。具体详见  python/examples/criteo_ctr_with_cube/test_server_quant.py。客户端部分不需要做任何改动。
+在Serving当中，使用general_dist_kv_quant_infer op来进行预测时使用量化模型。具体详见  Serving/examples/C++/PaddleRec/criteo_ctr_with_cube/test_server_quant.py。客户端部分不需要做任何改动。

 为方便用户做demo，我们给出了从0开始启动量化模型Serving。
 ```
-cd python/examples/criteo_ctr_with_cube
+cd Serving/examples/C++/PaddleRec/criteo_ctr_with_cube
 python local_train.py
 cp ../../../build_server/core/predictor/seq_generator seq_generator
 cp ../../../build_server/output/bin/cube* ./cube/

--- a/doc/Cube_Quant_EN.md
+++ b/doc/Cube_Quant_EN.md
@@ -21,7 +21,7 @@ This tool is used to convert the Paddle model into a Sequence File. Here, two mo
 In Serving Directory，train the model in the criteo_ctr_with_cube directory

 ```
-cd python/examples/criteo_ctr_with_cube
+cd Serving/examples/C++/PaddleRec/criteo_ctr_with_cube
 python local_train.py # save model
 ```
 Next, you can use quantization and non-quantization to generate Sequence File for Cube sparse parameter indexing.
@@ -34,11 +34,11 @@ This command will convert the sparse parameter file SparseFeatFactors in the ctr

 ## Launch Serving by Quantized Model

-In Serving, a quantized model is used when using general_dist_kv_quant_infer op to make predictions. See python/examples/criteo_ctr_with_cube/test_server_quant.py for details. No changes are required on the client side.
+In Serving, a quantized model is used when using general_dist_kv_quant_infer op to make predictions. See Serving/examples/C++/PaddleRec/criteo_ctr_with_cube/test_server_quant.py for details. No changes are required on the client side.

 In order to make the demo easier for users, the following script is to train the quantized criteo ctr model and launch serving by it.
 ```
-cd python/examples/criteo_ctr_with_cube
+cd Serving/examples/C++/PaddleRec/criteo_ctr_with_cube
 python local_train.py
 cp ../../../build_server/core/predictor/seq_generator seq_generator
 cp ../../../build_server/output/bin/cube* ./cube/

--- a/doc/Cube_Test_CN.md
+++ b/doc/Cube_Test_CN.md
@@ -2,7 +2,7 @@

 ### 背景知识

-推荐系统需要大规模稀疏参数索引来帮助分布式部署，可在`python/example/criteo_ctr_with_cube`或是[PaddleRec](https://github.com/paddlepaddle/paddlerec)了解推荐模型。
+推荐系统需要大规模稀疏参数索引来帮助分布式部署，可在`Serving/examples/C++/PaddleRec/criteo_ctr_with_cube`或是[PaddleRec](https://github.com/paddlepaddle/paddlerec)了解推荐模型。

 稀疏参数索引的模型格式是SequenceFile，源自Hadoop生态的键值对格式文件。


--- a/doc/Quick_Start_CN.md
+++ b/doc/Quick_Start_CN.md
@@ -82,7 +82,7 @@ print(fetch_map)

 <h3 align="center">HTTP服务</h3>

-用户也可以将数据格式处理逻辑放在服务器端进行，这样就可以直接用curl去访问服务，参考如下案例，在目录`python/examples/fit_a_line`.
+用户也可以将数据格式处理逻辑放在服务器端进行，这样就可以直接用curl去访问服务，参考如下案例，在目录`Serving/examples/C++/fit_a_line`.

 ```
 python3 -m paddle_serving_server.serve --model uci_housing_model --thread 10 --port 9292 --name uci

--- a/doc/Quick_Start_EN.md
+++ b/doc/Quick_Start_EN.md
@@ -8,7 +8,7 @@ This quick start example is mainly for those users who already have a model to d

 get into the Serving git directory, and change dir to `fit_a_line`
 ``` shell
-cd Serving/python/examples/fit_a_line
+cd Serving/examples/C++/fit_a_line
 sh get_data.sh
 ```

@@ -54,7 +54,7 @@ Here, `client.predict` function has two arguments. `feed` is a `python dict` wit


 ### WEB service
-Users can also put the data format processing logic on the server side, so that they can directly use curl to access the service, refer to the following case whose path is `python/examples/fit_a_line`
+Users can also put the data format processing logic on the server side, so that they can directly use curl to access the service, refer to the following case whose path is `Serving/examples/C++/fit_a_line`

 ```
 python3 -m paddle_serving_server.serve --model uci_housing_model --thread 10 --port 9292 --name uci

--- a/doc/Run_On_Kubernetes_CN.md
+++ b/doc/Run_On_Kubernetes_CN.md
@@ -47,12 +47,12 @@ bash tools/generate_runtime_docker.sh --help

 #### Pipeline模式：

-对于pipeline模式，我们需要确保模型和程序文件、配置文件等各种依赖都能够在镜像中运行。因此可以在`/home/project`下存放我们的执行文件时，我们以`Serving/python/example/pipeline/ocr`为例，这是OCR文字识别任务。
+对于pipeline模式，我们需要确保模型和程序文件、配置文件等各种依赖都能够在镜像中运行。因此可以在`/home/project`下存放我们的执行文件时，我们以`Serving/examples/Pipeline/PaddleOCR/ocr`为例，这是OCR文字识别任务。

 ```bash
 #假设您已经拥有Serving运行镜像，假设镜像名为paddle_serving:cuda10.2-py36
 docker run --rm -dit --name pipeline_serving_demo paddle_serving:cuda10.2-py36 bash
-cd Serving/python/example/pipeline/ocr
+cd Serving/examples/Pipeline/PaddleOCR/ocr
 # get models
 python -m paddle_serving_app.package --get_model ocr_rec
 tar -xzvf ocr_rec.tar.gz
@@ -80,12 +80,12 @@ python3.6 web_service.py

 #### WebService模式：

-web service模式本质上和pipeline模式类似，因此我们以`Serving/python/examples/bert`为例
+web service模式本质上和pipeline模式类似，因此我们以`Serving/examples/C++/PaddleNLP/bert`为例

 ```bash
 #假设您已经拥有Serving运行镜像，假设镜像名为registry.baidubce.com/paddlepaddle/serving:0.6.0-cuda10.2-py36
 docker run --rm -dit --name webservice_serving_demo registry.baidubce.com/paddlepaddle/serving:0.6.0-cpu-py36 bash
-cd Serving/python/examples/bert
+cd Serving/examples/C++/PaddleNLP/bert
 ### download model 
 wget https://paddle-serving.bj.bcebos.com/paddle_hub_models/text/SemanticModel/bert_chinese_L-12_H-768_A-12.tar.gz
 tar -xzf bert_chinese_L-12_H-768_A-12.tar.gz

--- a/doc/Run_On_XPU_CN.md
+++ b/doc/Run_On_XPU_CN.md
@@ -104,8 +104,8 @@ print(fetch_map)
 以下提供部分样例，其他模型可参照进行修改。
 | 示例名称   | 示例链接                                                    |
 | :--------- | :---------------------------------------------------------- |
-| fit_a_line | [fit_a_line_xpu](../python/examples/xpu/fit_a_line_xpu)     |
-| resnet     | [resnet_v2_50_xpu](../python/examples/xpu/resnet_v2_50_xpu) |
+| fit_a_line | [fit_a_line_xpu](../examples/C++/xpu/resnet_v2_50_xpu)     |
+| resnet     | [resnet_v2_50_xpu](../examples/C++/xpu/resnet_v2_50_xpu) |

 注：支持昆仑芯片部署模型列表见[链接](https://paddlelite.paddlepaddle.org.cn/introduction/support_model_list.html)。不同模型适配上存在差异，可能存在不支持的情况，部署使用存在问题时，欢迎以[Github issue](https://github.com/PaddlePaddle/Serving/issues)，我们会实时跟进。


--- a/doc/Run_On_XPU_EN.md
+++ b/doc/Run_On_XPU_EN.md
@@ -109,8 +109,8 @@ print(fetch_map)
 Some examples are provided below, and other models can be modifed with reference to these examples.
 | sample name | sample links                                                |
 | :---------- | :---------------------------------------------------------- |
-| fit_a_line  | [fit_a_line_xpu](../python/examples/xpu/fit_a_line_xpu)     |
-| resnet      | [resnet_v2_50_xpu](../python/examples/xpu/resnet_v2_50_xpu) |
+| fit_a_line  | [fit_a_line_xpu](../examples/C++/xpu/fit_a_line_xpu)     |
+| resnet      | [resnet_v2_50_xpu](../examples/C++/xpu/resnet_v2_50_xpu) |

 Note：Supported model lists refer to [doc](https://paddlelite.paddlepaddle.org.cn/introduction/support_model_list.html). There are differences in the adaptation of different models, and there may be some unsupported cases. If you have any problem，please submit [Github issue](https://github.com/PaddlePaddle/Serving/issues), and we will follow up in real time.


--- a/doc/Windows_Tutorial_CN.md
+++ b/doc/Windows_Tutorial_CN.md
@@ -38,7 +38,7 @@ pip install -r python/requirements_win.txt
 **运行OCR示例**：

 ```
-cd Serving/python/example/ocr
+cd Serving/examples/C++/PaddleOCR/ocr/
 python -m paddle_serving_app.package --get_model ocr_rec
 tar -xzvf ocr_rec.tar.gz
 python -m paddle_serving_app.package --get_model ocr_det
@@ -70,7 +70,7 @@ class YourWebService(WebService):
 your_service = YourService(name="XXX")
 your_service.load_model_config("your_model_path")
 your_service.prepare_server(workdir="workdir", port=9292)
-# 如果是GPU用户，可以参照python/examples/ocr下的python示例
+# 如果是GPU用户，可以参照Serving/examples/Pipeline/PaddleOCR/ocr下的python示例
 your_service.run_debugger_service()
 # Windows平台不可以使用 run_rpc_service()接口
 your_service.run_web_service()

--- a/doc/Windows_Tutorial_EN.md
+++ b/doc/Windows_Tutorial_EN.md
@@ -38,7 +38,7 @@ pip install -r python/requirements_win.txt
 **Run OCR example**:

 ```
-cd Serving/python/example/ocr
+cd Serving/examples/C++/PaddleOCR/ocr/
 python -m paddle_serving_app.package --get_model ocr_rec
 tar -xzvf ocr_rec.tar.gz
 python -m paddle_serving_app.package --get_model ocr_det
@@ -70,7 +70,7 @@ class YourWebService(WebService):
 your_service = YourService(name="XXX")
 your_service.load_model_config("your_model_path")
 your_service.prepare_server(workdir="workdir", port=9292)
-# If you are a GPU user, you can refer to the python example under python/examples/ocr
+# If you are a GPU user, you can refer to the python example under Serving/examples/Pipeline/PaddleOCR/ocr
 your_service.run_debugger_service()
 # Windows platform cannot use run_rpc_service() interface
 your_service.run_web_service()

--- a/python/examples/README.md
+++ b/python/examples/README.md
-## Examples
-
-### Support `--use_trt`
-
-the following models support `--use_trt`, which means you can use TensorRT to accelerate inference at Cuda 10.1 or higher.
-
- imagenet ResNet50/ResNet101
- detection faster_rcnn/yolov3/pp-yolo/ttf-net
-
-
-
--- a/python/examples/README_CN.md
+++ b/python/examples/README_CN.md
-## Serving模型示例
-
-### 支持TensorRT的模型列表 `--use_trt`
-
-以下模型支持TensorRT，可以开启 `--use_trt`来加速在线预测，其他模型不能开启。
-
- imagenet ResNet50/ResNet101
- detection faster_rcnn/yolov3/pp-yolo/ttf-net
-
--- a/python/examples/bert/README.md
+++ b/python/examples/bert/README.md
-Http## Bert as service
-
-([简体中文](./README_CN.md)|English)
-
-In the example, a BERT model is used for semantic understanding prediction, and the text is represented as a vector, which can be used for further analysis and prediction.
-If your python version is 3.X, replace the 'pip' field in the following command with 'pip3',replace 'python' with 'python3'.
-
-### Getting Model
-method 1:
-This example use model [BERT Chinese Model](https://www.paddlepaddle.org.cn/hubdetail?name=bert_chinese_L-12_H-768_A-12&en_category=SemanticModel) from [Paddlehub](https://github.com/PaddlePaddle/PaddleHub).
-
-Install paddlehub first
-```
-pip3 install paddlehub
-```
-
-run 
-```
-python3 prepare_model.py 128
-```
-
-**PaddleHub only support Python 3.5+**
-
-the 128 in the command above means max_seq_len in BERT model, which is the length of sample after preprocessing.
-the config file and model file for server side are saved in the folder bert_seq128_model.
-the config file generated for client side is saved in the folder bert_seq128_client.
-
-method 2:
-You can also download the above model from BOS(max_seq_len=128). After decompression, the config file and model file for server side are stored in the bert_chinese_L-12_H-768_A-12_model folder, and the config file generated for client side is stored in the bert_chinese_L-12_H-768_A-12_client folder:
-```shell
-wget https://paddle-serving.bj.bcebos.com/paddle_hub_models/text/SemanticModel/bert_chinese_L-12_H-768_A-12.tar.gz
-tar -xzf bert_chinese_L-12_H-768_A-12.tar.gz
-mv bert_chinese_L-12_H-768_A-12_model bert_seq128_model
-mv bert_chinese_L-12_H-768_A-12_client bert_seq128_client
-```
-if your model is bert_chinese_L-12_H-768_A-12_model, replace the 'bert_seq128_model' field in the following command with 'bert_chinese_L-12_H-768_A-12_model',replace 'bert_seq128_client' with 'bert_chinese_L-12_H-768_A-12_client'.
-
-### Getting Dict and Sample Dataset
-
-```
-sh get_data.sh
-```
-this script will download Chinese Dictionary File vocab.txt and Chinese Sample Data data-c.txt
-
-### Inference Service(Support BRPC-Client、GRPC-Client、Http-Client)
-start cpu inference service,Run
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292  #cpu inference service
-```
-Or,start gpu inference service,Run
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292 --gpu_ids 0 #launch gpu inference service at GPU 0
-```
-
-### BRPC-Client Inference
-
-before prediction we should install paddle_serving_app. This module provides data preprocessing for BERT model.
-```
-pip3 install paddle_serving_app
-```
-Run
-```
-head data-c.txt | python3 bert_client.py --model bert_seq128_client/serving_client_conf.prototxt
-```
-
-the client reads data from data-c.txt and send prediction request, the prediction is given by word vector. (Due to massive data in the word vector, we do not print it).
-
-#### GRPC-Client/HTTP-Client
-Run
-```
-head data-c.txt | python3 bert_httpclient.py --model bert_seq128_client/serving_client_conf.prototxt
-
-```
-
-
-## Benchmark
-``` shell
-bash benchmark.sh bert_seq128_model bert_seq128_client
-```
-The output log file of benchmark named `profile_log_bert_seq128_model`
--- a/python/examples/bert/README_CN.md
+++ b/python/examples/bert/README_CN.md
-## 语义理解预测服务
-
-(简体中文|[English](./README.md))
-
-示例中采用BERT模型进行语义理解预测，将文本表示为向量的形式，可以用来做进一步的分析和预测。
-
-若使用python的版本为3.X, 将以下命令中的pip 替换为pip3, python替换为python3.
-### 获取模型
-方法1：
-示例中采用[Paddlehub](https://github.com/PaddlePaddle/PaddleHub)中的[BERT中文模型](https://www.paddlepaddle.org.cn/hubdetail?name=bert_chinese_L-12_H-768_A-12&en_category=SemanticModel)。
-请先安装paddlehub
-```
-pip3 install paddlehub
-```
-执行
-```
-python3 prepare_model.py 128
-```
-参数128表示BERT模型中的max_seq_len，即预处理后的样本长度。
-生成server端配置文件与模型文件，存放在bert_seq128_model文件夹。
-生成client端配置文件，存放在bert_seq128_client文件夹。
-
-方法2：
-您也可以从bos上直接下载上述模型（max_seq_len=128），解压后server端配置文件与模型文件存放在bert_chinese_L-12_H-768_A-12_model文件夹，client端配置文件存放在bert_chinese_L-12_H-768_A-12_client文件夹：
-```shell
-wget https://paddle-serving.bj.bcebos.com/paddle_hub_models/text/SemanticModel/bert_chinese_L-12_H-768_A-12.tar.gz
-tar -xzf bert_chinese_L-12_H-768_A-12.tar.gz
-mv bert_chinese_L-12_H-768_A-12_model bert_seq128_model
-mv bert_chinese_L-12_H-768_A-12_client bert_seq128_client
-```
-若使用bert_chinese_L-12_H-768_A-12_model模型，将下面命令中的bert_seq128_model字段替换为bert_chinese_L-12_H-768_A-12_model，bert_seq128_client字段替换为bert_chinese_L-12_H-768_A-12_client.
-
-
-
-
-### 获取词典和样例数据
-
-```
-sh get_data.sh
-```
-脚本将下载中文词典vocab.txt和中文样例数据data-c.txt
-
-### 启动预测服务（支持BRPC-Client、GRPC-Client、HTTP-Client三种方式访问）
-启动cpu预测服务，执行
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292  #启动cpu预测服务
-
-```
-或者，启动gpu预测服务，执行
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292 --gpu_ids 0 #在gpu 0上启动gpu预测服务
-
-```
-
-### 执行预测
-
-执行预测前需要安装paddle_serving_app，模块中提供了BERT模型的数据预处理方法。
-```
-pip3 install paddle_serving_app
-```
-
-#### BRPC-Client
-执行
-```
-head data-c.txt | python3 bert_client.py --model bert_seq128_client/serving_client_conf.prototxt
-
-```
-启动client读取data-c.txt中的数据进行预测，预测结果为文本的向量表示（由于数据较多，脚本中没有将输出进行打印），server端的地址在脚本中修改。
-
-#### GRPC-Client/HTTP-Client
-执行
-```
-head data-c.txt | python3 bert_httpclient.py --model bert_seq128_client/serving_client_conf.prototxt
-
-```
-
-## 性能测试
-``` shell
-bash benchmark.sh bert_seq128_model bert_seq128_client
-```
-性能测试的日志文件为profile_log_bert_seq128_model
-
-如需修改性能测试用例的参数，请修改benchmark.sh中的配置信息。
-
-注意:bert_seq128_model和bert_seq128_client路径后不要加'/'符号,示例需要在GPU机器上运行。
--- a/python/examples/bert/batching.py
+++ b/python/examples/bert/batching.py
-#coding:utf-8
-#   Copyright (c) 2019 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""Mask, padding and batching."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-
-def prepare_batch_data(insts,
-                       total_token_num,
-                       max_seq_len=128,
-                       pad_id=None,
-                       cls_id=None,
-                       sep_id=None,
-                       mask_id=None,
-                       return_input_mask=True,
-                       return_max_len=True,
-                       return_num_token=False):
-    """
-    1. generate Tensor of data
-    2. generate Tensor of position
-    3. generate self attention mask, [shape: batch_size *  max_len * max_len]
-    """
-
-    batch_src_ids = [inst[0] for inst in insts]
-    batch_sent_ids = [inst[1] for inst in insts]
-    batch_pos_ids = [inst[2] for inst in insts]
-    labels_list = []
-    # compatible with squad, whose example includes start/end positions,
-    # or unique id
-
-    for i in range(3, len(insts[0]), 1):
-        labels = [inst[i] for inst in insts]
-        labels = np.array(labels).astype("int64").reshape([-1, 1])
-        labels_list.append(labels)
-
-    out = batch_src_ids
-    # Second step: padding
-    src_id, self_input_mask = pad_batch_data(
-        out, pad_idx=pad_id, max_seq_len=max_seq_len, return_input_mask=True)
-    pos_id = pad_batch_data(
-        batch_pos_ids,
-        pad_idx=pad_id,
-        max_seq_len=max_seq_len,
-        return_pos=False,
-        return_input_mask=False)
-    sent_id = pad_batch_data(
-        batch_sent_ids,
-        pad_idx=pad_id,
-        max_seq_len=max_seq_len,
-        return_pos=False,
-        return_input_mask=False)
-
-    return_list = [src_id, pos_id, sent_id, self_input_mask] + labels_list
-
-    return return_list if len(return_list) > 1 else return_list[0]
-
-
-def pad_batch_data(insts,
-                   pad_idx=0,
-                   max_seq_len=128,
-                   return_pos=False,
-                   return_input_mask=False,
-                   return_max_len=False,
-                   return_num_token=False,
-                   return_seq_lens=False):
-    """
-    Pad the instances to the max sequence length in batch, and generate the
-    corresponding position data and input mask.
-    """
-    return_list = []
-    #max_len = max(len(inst) for inst in insts)
-    max_len = max_seq_len
-    # Any token included in dict can be used to pad, since the paddings' loss
-    # will be masked out by weights and make no effect on parameter gradients.
-
-    inst_data = np.array([
-        list(inst) + list([pad_idx] * (max_len - len(inst))) for inst in insts
-    ])
-    return_list += [inst_data.astype("int64").reshape([-1, max_len, 1])]
-
-    # position data
-    if return_pos:
-        inst_pos = np.array([
-            list(range(0, len(inst))) + [pad_idx] * (max_len - len(inst))
-            for inst in insts
-        ])
-
-        return_list += [inst_pos.astype("int64").reshape([-1, max_len, 1])]
-
-    if return_input_mask:
-        # This is used to avoid attention on paddings.
-        input_mask_data = np.array(
-            [[1] * len(inst) + [0] * (max_len - len(inst)) for inst in insts])
-        input_mask_data = np.expand_dims(input_mask_data, axis=-1)
-        return_list += [input_mask_data.astype("float32")]
-
-    if return_max_len:
-        return_list += [max_len]
-
-    if return_num_token:
-        num_token = 0
-        for inst in insts:
-            num_token += len(inst)
-        return_list += [num_token]
-
-    if return_seq_lens:
-        seq_lens = np.array([len(inst) for inst in insts])
-        return_list += [seq_lens.astype("int64").reshape([-1, 1])]
-
-    return return_list if len(return_list) > 1 else return_list[0]
--- a/python/examples/bert/benchmark.py
+++ b/python/examples/bert/benchmark.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-import json
-import requests
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args, show_latency
-from paddle_serving_app.reader import ChineseBertReader
-
-args = benchmark_args()
-
-
-def single_func(idx, resource):
-    fin = open("data-c.txt")
-    dataset = []
-    for line in fin:
-        dataset.append(line.strip())
-
-    profile_flags = False
-    latency_flags = False
-    if os.getenv("FLAGS_profile_client"):
-        profile_flags = True
-    if os.getenv("FLAGS_serving_latency"):
-        latency_flags = True
-        latency_list = []
-
-    if args.request == "rpc":
-        reader = ChineseBertReader({"max_seq_len": 128})
-        fetch = ["pooled_output"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-        start = time.time()
-        for i in range(turns):
-            if args.batch_size >= 1:
-                l_start = time.time()
-                feed_batch = []
-                b_start = time.time()
-                for bi in range(args.batch_size):
-                    feed_dict = reader.process(dataset[bi])
-                    for key in feed_dict.keys():
-                        feed_dict[key] = np.array(feed_dict[key]).reshape(
-                            (1, 128, 1))
-                    feed_batch.append(feed_dict)
-                b_end = time.time()
-
-                if profile_flags:
-                    sys.stderr.write(
-                        "PROFILE\tpid:{}\tbert_pre_0:{} bert_pre_1:{}\n".format(
-                            os.getpid(),
-                            int(round(b_start * 1000000)),
-                            int(round(b_end * 1000000))))
-                result = client.predict(
-                    feed=feed_batch, fetch=fetch, batch=True)
-
-                l_end = time.time()
-                if latency_flags:
-                    latency_list.append(l_end * 1000 - l_start * 1000)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-
-    elif args.request == "http":
-        reader = ChineseBertReader({"max_seq_len": 128})
-        fetch = ["pooled_output"]
-        server = "http://" + resource["endpoint"][idx % len(resource[
-            "endpoint"])] + "/bert/prediction"
-        start = time.time()
-        for i in range(turns):
-            if args.batch_size >= 1:
-                l_start = time.time()
-                feed_batch = []
-                b_start = time.time()
-                for bi in range(args.batch_size):
-                    feed_batch.append({"words": dataset[bi]})
-                req = json.dumps({"feed": feed_batch, "fetch": fetch})
-                b_end = time.time()
-
-                if profile_flags:
-                    sys.stderr.write(
-                        "PROFILE\tpid:{}\tbert_pre_0:{} bert_pre_1:{}\n".format(
-                            os.getpid(),
-                            int(round(b_start * 1000000)),
-                            int(round(b_end * 1000000))))
-                result = requests.post(
-                    server,
-                    data=req,
-                    headers={"Content-Type": "application/json"})
-                l_end = time.time()
-                if latency_flags:
-                    latency_list.append(l_end * 1000 - l_start * 1000)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-
-    else:
-        raise ValueError("not implemented {} request".format(args.request))
-    end = time.time()
-    if latency_flags:
-        return [[end - start], latency_list]
-    else:
-        return [[end - start]]
-
-
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292", "127.0.0.1:9293"]
-    turns = 100
-    start = time.time()
-    result = multi_thread_runner.run(
-        single_func, args.thread, {"endpoint": endpoint_list,
-                                   "turns": turns})
-    end = time.time()
-    total_cost = end - start
-
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-
-    print("total cost: {}s".format(total_cost))
-    print("each thread cost: {}s. ".format(avg_cost))
-    print("qps: {}samples/s".format(args.batch_size * args.thread * turns /
-                                    total_cost))
-    if os.getenv("FLAGS_serving_latency"):
-        show_latency(result[1])
--- a/python/examples/bert/benchmark.sh
+++ b/python/examples/bert/benchmark.sh
-rm profile_log*
-export CUDA_VISIBLE_DEVICES=0,1
-export FLAGS_profile_server=1
-export FLAGS_profile_client=1
-export FLAGS_serving_latency=1
-
-gpu_id=0
-#save cpu and gpu utilization log
-if [ -d utilization ];then
-    rm -rf utilization
-else
-    mkdir utilization
-fi
-#start server
-$PYTHONROOT/bin/python3 -m paddle_serving_server.serve --model $1 --port 9292 --thread 4 --gpu_ids 0,1 --mem_optim  --ir_optim >  elog  2>&1 &
-sleep 5
-
-#warm up
-$PYTHONROOT/bin/python3 benchmark.py --thread 4 --batch_size 1 --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-echo -e "import psutil\nimport time\nwhile True:\n\tcpu_res = psutil.cpu_percent()\n\twith open('cpu.txt', 'a+') as f:\n\t\tf.write(f'{cpu_res}\\\n')\n\ttime.sleep(0.1)" > cpu.py
-for thread_num in 1 4 8 16
-do
-for batch_size in 1 4 16 64
-do
-    job_bt=`date '+%Y%m%d%H%M%S'`
-    nvidia-smi --id=0 --query-compute-apps=used_memory --format=csv -lms 100 > gpu_memory_use.log 2>&1 &
-    nvidia-smi --id=0 --query-gpu=utilization.gpu --format=csv -lms 100 > gpu_utilization.log 2>&1 &
-    rm -rf cpu.txt
-    $PYTHONROOT/bin/python3 cpu.py &
-    gpu_memory_pid=$!
-    $PYTHONROOT/bin/python3 benchmark.py --thread $thread_num --batch_size $batch_size --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-    kill `ps -ef|grep used_memory|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep utilization.gpu|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep cpu.py|awk '{print $2}'` > /dev/null
-    echo "model_name:" $1
-    echo "thread_num:" $thread_num
-    echo "batch_size:" $batch_size
-    echo "=================Done===================="
-    echo "model_name:$1" >> profile_log_$1
-    echo "batch_size:$batch_size" >> profile_log_$1
-    job_et=`date '+%Y%m%d%H%M%S'`
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "CPU_UTILIZATION:", max}' cpu.txt >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "MAX_GPU_MEMORY:", max}' gpu_memory_use.log >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "GPU_UTILIZATION:", max}' gpu_utilization.log >> profile_log_$1
-    rm -rf gpu_use.log gpu_utilization.log
-    $PYTHONROOT/bin/python3 ../util/show_profile.py profile $thread_num >> profile_log_$1
-    tail -n 8 profile >> profile_log_$1
-    echo "" >> profile_log_$1
-done
-done
-
-#Divided log
-awk 'BEGIN{RS="\n\n"}{i++}{print > "bert_log_"i}' profile_log_$1
-mkdir bert_log && mv bert_log_* bert_log
-ps -ef|grep 'serving'|grep -v grep|cut -c 9-15 | xargs kill -9
--- a/python/examples/bert/benchmark_with_profile.sh
+++ b/python/examples/bert/benchmark_with_profile.sh
-export CUDA_VISIBLE_DEVICES=0,1
-python -m paddle_serving_server.serve --model bert_seq20_model/ --port 9295 --thread 4 --gpu_ids 0,1 2> elog > stdlog &
-export FLAGS_profile_client=1
-export FLAGS_profile_server=1
-sleep 5
-thread_num=4
-python benchmark_batch.py --thread ${thread_num} --batch_size 64 --model serving_client_conf/serving_client_conf.prototxt 2> profile
-
-python show_profile.py profile ${thread_num}
-python timeline_trace.py profile trace
--- a/python/examples/bert/bert_client.py
+++ b/python/examples/bert/bert_client.py
-# coding:utf-8
-# pylint: disable=doc-string-missing
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-from paddle_serving_client import Client
-from paddle_serving_client.utils import benchmark_args
-from paddle_serving_app.reader import ChineseBertReader
-import numpy as np
-args = benchmark_args()
-
-reader = ChineseBertReader({"max_seq_len": 128})
-fetch = ["pooled_output"]
-endpoint_list = ['127.0.0.1:9292']
-client = Client()
-client.load_client_config(args.model)
-client.connect(endpoint_list)
-
-for line in sys.stdin:
-    feed_dict = reader.process(line)
-    for key in feed_dict.keys():
-        feed_dict[key] = np.array(feed_dict[key]).reshape((128, 1))
-    #print(feed_dict)
-    result = client.predict(feed=feed_dict, fetch=fetch, batch=False)
-print(result)
--- a/python/examples/bert/bert_gpu_server.py
+++ b/python/examples/bert/bert_gpu_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-general_response_op = op_maker.create('general_response')
-
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(general_response_op)
-
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(8)
-server.set_memory_optimize(True)
-server.set_gpuid(1)
-
-server.load_model_config(sys.argv[1])
-port = int(sys.argv[2])
-gpuid = sys.argv[3]
-server.set_gpuid(gpuid)
-server.prepare_server(workdir="work_dir1", port=port, device="gpu")
-server.run_server()
--- a/python/examples/bert/bert_httpclient.py
+++ b/python/examples/bert/bert_httpclient.py
-# coding:utf-8
-# pylint: disable=doc-string-missing
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-from paddle_serving_client import HttpClient
-from paddle_serving_client.utils import benchmark_args
-from paddle_serving_app.reader import ChineseBertReader
-import numpy as np
-args = benchmark_args()
-
-reader = ChineseBertReader({"max_seq_len": 128})
-fetch = ["pooled_output"]
-endpoint_list = ['127.0.0.1:9292']
-client = HttpClient()
-client.load_client_config(args.model)
-''' 
-if you want use GRPC-client, set_use_grpc_client(True)
-or you can directly use client.grpc_client_predict(...)
-as for HTTP-client,set_use_grpc_client(False)(which is default)
-or you can directly use client.http_client_predict(...)
-'''
-#client.set_use_grpc_client(True)
-'''
-if you want to enable Encrypt Module,uncommenting the following line
-'''
-#client.use_key("./key")
-'''
-if you want to compress,uncommenting the following line
-'''
-#client.set_response_compress(True)
-#client.set_request_compress(True)
-'''
-we recommend use Proto data format in HTTP-body, set True(which is default)
-if you want use JSON data format in HTTP-body, set False
-'''
-#client.set_http_proto(True)
-client.connect(endpoint_list)
-
-for line in sys.stdin:
-    feed_dict = reader.process(line)
-    for key in feed_dict.keys():
-        feed_dict[key] = np.array(feed_dict[key]).reshape((128, 1))
-    #print(feed_dict)
-    result = client.predict(feed=feed_dict, fetch=fetch, batch=False)
-print(result)
--- a/python/examples/bert/bert_reader.py
+++ b/python/examples/bert/bert_reader.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from batching import pad_batch_data
-import tokenization
-
-
-class BertReader():
-    def __init__(self, vocab_file="", max_seq_len=128):
-        self.vocab_file = vocab_file
-        self.tokenizer = tokenization.FullTokenizer(vocab_file=vocab_file)
-        self.max_seq_len = max_seq_len
-        self.vocab = self.tokenizer.vocab
-        self.pad_id = self.vocab["[PAD]"]
-        self.cls_id = self.vocab["[CLS]"]
-        self.sep_id = self.vocab["[SEP]"]
-        self.mask_id = self.vocab["[MASK]"]
-
-    def pad_batch(self, token_ids, text_type_ids, position_ids):
-        batch_token_ids = [token_ids]
-        batch_text_type_ids = [text_type_ids]
-        batch_position_ids = [position_ids]
-
-        padded_token_ids, input_mask = pad_batch_data(
-            batch_token_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id,
-            return_input_mask=True)
-        padded_text_type_ids = pad_batch_data(
-            batch_text_type_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id)
-        padded_position_ids = pad_batch_data(
-            batch_position_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id)
-        return padded_token_ids, padded_position_ids, padded_text_type_ids, input_mask
-
-    def process(self, sent):
-        text_a = tokenization.convert_to_unicode(sent)
-        tokens_a = self.tokenizer.tokenize(text_a)
-        if len(tokens_a) > self.max_seq_len - 2:
-            tokens_a = tokens_a[0:(self.max_seq_len - 2)]
-        tokens = []
-        text_type_ids = []
-        tokens.append("[CLS]")
-        text_type_ids.append(0)
-        for token in tokens_a:
-            tokens.append(token)
-            text_type_ids.append(0)
-        token_ids = self.tokenizer.convert_tokens_to_ids(tokens)
-        position_ids = list(range(len(token_ids)))
-        p_token_ids, p_pos_ids, p_text_type_ids, input_mask = \
-            self.pad_batch(token_ids, text_type_ids, position_ids)
-        feed_result = {
-            "input_ids": p_token_ids.reshape(-1).tolist(),
-            "position_ids": p_pos_ids.reshape(-1).tolist(),
-            "segment_ids": p_text_type_ids.reshape(-1).tolist(),
-            "input_mask": input_mask.reshape(-1).tolist()
-        }
-        return feed_result
--- a/python/examples/bert/bert_server.py
+++ b/python/examples/bert/bert_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-general_response_op = op_maker.create('general_response')
-
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(general_response_op)
-
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(4)
-
-server.load_model_config(sys.argv[1])
-port = int(sys.argv[2])
-server.prepare_server(workdir="work_dir1", port=port, device="cpu")
-server.run_server()
--- a/python/examples/bert/get_data.sh
+++ b/python/examples/bert/get_data.sh
-wget https://paddle-serving.bj.bcebos.com/bert_example/data-c.txt --no-check-certificate
-wget https://paddle-serving.bj.bcebos.com/bert_example/vocab.txt --no-check-certificate
--- a/python/examples/bert/prepare_model.py
+++ b/python/examples/bert/prepare_model.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import paddlehub as hub
-import paddle.fluid as fluid
-import sys
-import paddle_serving_client.io as serving_io
-import paddle
-
-paddle.enable_static()
-model_name = "bert_chinese_L-12_H-768_A-12"
-module = hub.Module(name=model_name)
-inputs, outputs, program = module.context(
-    trainable=True, max_seq_len=int(sys.argv[1]))
-place = fluid.core_avx.CPUPlace()
-exe = fluid.Executor(place)
-input_ids = inputs["input_ids"]
-position_ids = inputs["position_ids"]
-segment_ids = inputs["segment_ids"]
-input_mask = inputs["input_mask"]
-pooled_output = outputs["pooled_output"]
-sequence_output = outputs["sequence_output"]
-
-feed_var_names = [
-    input_ids.name, position_ids.name, segment_ids.name, input_mask.name
-]
-
-target_vars = [pooled_output, sequence_output]
-
-serving_io.save_model(
-    "bert_seq{}_model".format(sys.argv[1]),
-    "bert_seq{}_client".format(sys.argv[1]), {
-        "input_ids": input_ids,
-        "position_ids": position_ids,
-        "segment_ids": segment_ids,
-        "input_mask": input_mask,
-    }, {"pooled_output": pooled_output,
-        "sequence_output": sequence_output}, program)
--- a/python/examples/bert/test_multi_fetch_client.py
+++ b/python/examples/bert/test_multi_fetch_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-from paddle_serving_client import Client
-from paddle_serving_app.reader import ChineseBertReader
-import sys
-import numpy as np
-
-client = Client()
-client.load_client_config("./bert_seq32_client/serving_client_conf.prototxt")
-client.connect(["127.0.0.1:9292"])
-
-reader = ChineseBertReader({"max_seq_len": 32})
-fetch = ["sequence_10", "sequence_12", "pooled_output"]
-expected_shape = {
-    "sequence_10": (4, 32, 768),
-    "sequence_12": (4, 32, 768),
-    "pooled_output": (4, 768)
-}
-batch_size = 4
-feed_batch = {}
-
-batch_len = 0
-for line in sys.stdin:
-    feed = reader.process(line)
-    if batch_len == 0:
-        for key in feed.keys():
-            val_len = len(feed[key])
-            feed_batch[key] = np.array(feed[key]).reshape((1, val_len, 1))
-        continue
-    if len(feed_batch) < batch_size:
-        for key in feed.keys():
-            np.concatenate([
-                feed_batch[key], np.array(feed[key]).reshape((1, val_len, 1))
-            ])
-    else:
-        fetch_map = client.predict(feed=feed_batch, fetch=fetch)
-        feed_batch = []
-        for var_name in fetch:
-            if fetch_map[var_name].shape != expected_shape[var_name]:
-                print("fetch var {} shape error.".format(var_name))
-                sys.exit(1)
--- a/python/examples/bert/tokenization.py
+++ b/python/examples/bert/tokenization.py
-# coding=utf-8
-# Copyright 2018 The Google AI Language Team Authors.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#         http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""Tokenization classes."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import io
-import unicodedata
-import six
-import sentencepiece as spm
-import pickle
-
-
-def convert_to_unicode(text):  # pylint: disable=doc-string-with-all-args
-    """Converts `text` to Unicode (if it's not already), assuming utf-8 input."""
-    if six.PY3:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, bytes):
-            return text.decode("utf-8", "ignore")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    elif six.PY2:
-        if isinstance(text, str):
-            return text.decode("utf-8", "ignore")
-        elif isinstance(text, unicode):  # noqa
-            return text
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    else:
-        raise ValueError("Not running on Python2 or Python 3?")
-
-
-def printable_text(text):  # pylint: disable=doc-string-with-all-args
-    """Returns text encoded in a way suitable for print or `tf.logging`."""
-
-    # These functions want `str` for both Python2 and Python3, but in one case
-    # it's a Unicode string and in the other it's a byte string.
-    if six.PY3:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, bytes):
-            return text.decode("utf-8", "ignore")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    elif six.PY2:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, unicode):  # noqa
-            return text.encode("utf-8")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    else:
-        raise ValueError("Not running on Python2 or Python 3?")
-
-
-def load_vocab(vocab_file):  # pylint: disable=doc-string-with-all-args, doc-string-with-returns
-    """Loads a vocabulary file into a dictionary."""
-    vocab = collections.OrderedDict()
-    fin = io.open(vocab_file, "r", encoding="UTF-8")
-    for num, line in enumerate(fin):
-        items = convert_to_unicode(line.strip()).split("\t")
-        if len(items) > 2:
-            break
-        token = items[0]
-        index = items[1] if len(items) == 2 else num
-        token = token.strip()
-        vocab[token] = int(index)
-    fin.close()
-    return vocab
-
-
-def convert_by_vocab(vocab, items):
-    """Converts a sequence of [tokens|ids] using the vocab."""
-    output = []
-    for item in items:
-        output.append(vocab[item])
-    return output
-
-
-def convert_tokens_to_ids(vocab, tokens):
-    return convert_by_vocab(vocab, tokens)
-
-
-def convert_ids_to_tokens(inv_vocab, ids):
-    return convert_by_vocab(inv_vocab, ids)
-
-
-def whitespace_tokenize(text):
-    """Runs basic whitespace cleaning and splitting on a peice of text."""
-    text = text.strip()
-    if not text:
-        return []
-    tokens = text.split()
-    return tokens
-
-
-class FullTokenizer(object):
-    """Runs end-to-end tokenziation."""
-
-    def __init__(self,
-                 vocab_file,
-                 do_lower_case=True,
-                 use_sentence_piece_vocab=False):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.basic_tokenizer = BasicTokenizer(do_lower_case=do_lower_case)
-        self.use_sentence_piece_vocab = use_sentence_piece_vocab
-        self.wordpiece_tokenizer = WordpieceTokenizer(
-            vocab=self.vocab,
-            use_sentence_piece_vocab=self.use_sentence_piece_vocab)
-
-    def tokenize(self, text):
-        split_tokens = []
-        for token in self.basic_tokenizer.tokenize(text):
-            for sub_token in self.wordpiece_tokenizer.tokenize(token):
-                split_tokens.append(sub_token)
-
-        return split_tokens
-
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-
-
-class CharTokenizer(object):
-    """Runs end-to-end tokenziation."""
-
-    def __init__(self, vocab_file, do_lower_case=True):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.wordpiece_tokenizer = WordpieceTokenizer(vocab=self.vocab)
-
-    def tokenize(self, text):
-        split_tokens = []
-        for token in text.lower().split(" "):
-            for sub_token in self.wordpiece_tokenizer.tokenize(token):
-                split_tokens.append(sub_token)
-
-        return split_tokens
-
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-
-
-class WSSPTokenizer(object):  # pylint: disable=doc-string-missing
-    def __init__(self, vocab_file, sp_model_dir, word_dict, ws=True,
-                 lower=True):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.ws = ws
-        self.lower = lower
-        self.dict = pickle.load(open(word_dict, 'rb'))
-        self.sp_model = spm.SentencePieceProcessor()
-        self.window_size = 5
-        self.sp_model.Load(sp_model_dir)
-
-    def cut(self, chars):  # pylint: disable=doc-string-missing
-        words = []
-        idx = 0
-        while idx < len(chars):
-            matched = False
-            for i in range(self.window_size, 0, -1):
-                cand = chars[idx:idx + i]
-                if cand in self.dict:
-                    words.append(cand)
-                    matched = True
-                    break
-            if not matched:
-                i = 1
-                words.append(chars[idx])
-            idx += i
-        return words
-
-    def tokenize(self, text, unk_token="[UNK]"):  # pylint: disable=doc-string-missing
-        text = convert_to_unicode(text)
-        if self.ws:
-            text = [s for s in self.cut(text) if s != ' ']
-        else:
-            text = text.split(' ')
-        if self.lower:
-            text = [s.lower() for s in text]
-        text = ' '.join(text)
-        tokens = self.sp_model.EncodeAsPieces(text)
-        in_vocab_tokens = []
-        for token in tokens:
-            if token in self.vocab:
-                in_vocab_tokens.append(token)
-            else:
-                in_vocab_tokens.append(unk_token)
-        return in_vocab_tokens
-
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-
-
-class BasicTokenizer(object):
-    """Runs basic tokenization (punctuation splitting, lower casing, etc.)."""
-
-    def __init__(self, do_lower_case=True):
-        """Constructs a BasicTokenizer.
-
-        Args:
-            do_lower_case: Whether to lower case the input.
-        """
-        self.do_lower_case = do_lower_case
-
-    def tokenize(self, text):  # pylint: disable=doc-string-with-all-args, doc-string-with-returns
-        """Tokenizes a piece of text."""
-        text = convert_to_unicode(text)
-        text = self._clean_text(text)
-
-        # This was added on November 1st, 2018 for the multilingual and Chinese
-        # models. This is also applied to the English models now, but it doesn't
-        # matter since the English models were not trained on any Chinese data
-        # and generally don't have any Chinese data in them (there are Chinese
-        # characters in the vocabulary because Wikipedia does have some Chinese
-        # words in the English Wikipedia.).
-        text = self._tokenize_chinese_chars(text)
-
-        orig_tokens = whitespace_tokenize(text)
-        split_tokens = []
-        for token in orig_tokens:
-            if self.do_lower_case:
-                token = token.lower()
-                token = self._run_strip_accents(token)
-            split_tokens.extend(self._run_split_on_punc(token))
-
-        output_tokens = whitespace_tokenize(" ".join(split_tokens))
-        return output_tokens
-
-    def _run_strip_accents(self, text):
-        """Strips accents from a piece of text."""
-        text = unicodedata.normalize("NFD", text)
-        output = []
-        for char in text:
-            cat = unicodedata.category(char)
-            if cat == "Mn":
-                continue
-            output.append(char)
-        return "".join(output)
-
-    def _run_split_on_punc(self, text):
-        """Splits punctuation on a piece of text."""
-        chars = list(text)
-        i = 0
-        start_new_word = True
-        output = []
-        while i < len(chars):
-            char = chars[i]
-            if _is_punctuation(char):
-                output.append([char])
-                start_new_word = True
-            else:
-                if start_new_word:
-                    output.append([])
-                start_new_word = False
-                output[-1].append(char)
-            i += 1
-
-        return ["".join(x) for x in output]
-
-    def _tokenize_chinese_chars(self, text):
-        """Adds whitespace around any CJK character."""
-        output = []
-        for char in text:
-            cp = ord(char)
-            if self._is_chinese_char(cp):
-                output.append(" ")
-                output.append(char)
-                output.append(" ")
-            else:
-                output.append(char)
-        return "".join(output)
-
-    def _is_chinese_char(self, cp):
-        """Checks whether CP is the codepoint of a CJK character."""
-        # This defines a "chinese character" as anything in the CJK Unicode block:
-        #     https://en.wikipedia.org/wiki/CJK_Unified_Ideographs_(Unicode_block)
-        #
-        # Note that the CJK Unicode block is NOT all Japanese and Korean characters,
-        # despite its name. The modern Korean Hangul alphabet is a different block,
-        # as is Japanese Hiragana and Katakana. Those alphabets are used to write
-        # space-separated words, so they are not treated specially and handled
-        # like the all of the other languages.
-        if ((cp >= 0x4E00 and cp <= 0x9FFF) or  #
-            (cp >= 0x3400 and cp <= 0x4DBF) or  #
-            (cp >= 0x20000 and cp <= 0x2A6DF) or  #
-            (cp >= 0x2A700 and cp <= 0x2B73F) or  #
-            (cp >= 0x2B740 and cp <= 0x2B81F) or  #
-            (cp >= 0x2B820 and cp <= 0x2CEAF) or
-            (cp >= 0xF900 and cp <= 0xFAFF) or  #
-            (cp >= 0x2F800 and cp <= 0x2FA1F)):  #
-            return True
-
-        return False
-
-    def _clean_text(self, text):
-        """Performs invalid character removal and whitespace cleanup on text."""
-        output = []
-        for char in text:
-            cp = ord(char)
-            if cp == 0 or cp == 0xfffd or _is_control(char):
-                continue
-            if _is_whitespace(char):
-                output.append(" ")
-            else:
-                output.append(char)
-        return "".join(output)
-
-
-class WordpieceTokenizer(object):
-    """Runs WordPiece tokenziation."""
-
-    def __init__(self,
-                 vocab,
-                 unk_token="[UNK]",
-                 max_input_chars_per_word=100,
-                 use_sentence_piece_vocab=False):
-        self.vocab = vocab
-        self.unk_token = unk_token
-        self.max_input_chars_per_word = max_input_chars_per_word
-        self.use_sentence_piece_vocab = use_sentence_piece_vocab
-
-    def tokenize(self, text):  # pylint: disable=doc-string-with-all-args
-        """Tokenizes a piece of text into its word pieces.
-
-        This uses a greedy longest-match-first algorithm to perform tokenization
-        using the given vocabulary.
-
-        For example:
-            input = "unaffable"
-            output = ["un", "##aff", "##able"]
-
-        Args:
-            text: A single token or whitespace separated tokens. This should have
-                already been passed through `BasicTokenizer.
-
-        Returns:
-            A list of wordpiece tokens.
-        """
-
-        text = convert_to_unicode(text)
-
-        output_tokens = []
-        for token in whitespace_tokenize(text):
-            chars = list(token)
-            if len(chars) > self.max_input_chars_per_word:
-                output_tokens.append(self.unk_token)
-                continue
-
-            is_bad = False
-            start = 0
-            sub_tokens = []
-            while start < len(chars):
-                end = len(chars)
-                cur_substr = None
-                while start < end:
-                    substr = "".join(chars[start:end])
-                    if start == 0 and self.use_sentence_piece_vocab:
-                        substr = u'\u2581' + substr
-                    if start > 0 and not self.use_sentence_piece_vocab:
-                        substr = "##" + substr
-                    if substr in self.vocab:
-                        cur_substr = substr
-                        break
-                    end -= 1
-                if cur_substr is None:
-                    is_bad = True
-                    break
-                sub_tokens.append(cur_substr)
-                start = end
-
-            if is_bad:
-                output_tokens.append(self.unk_token)
-            else:
-                output_tokens.extend(sub_tokens)
-        return output_tokens
-
-
-def _is_whitespace(char):
-    """Checks whether `chars` is a whitespace character."""
-    # \t, \n, and \r are technically contorl characters but we treat them
-    # as whitespace since they are generally considered as such.
-    if char == " " or char == "\t" or char == "\n" or char == "\r":
-        return True
-    cat = unicodedata.category(char)
-    if cat == "Zs":
-        return True
-    return False
-
-
-def _is_control(char):
-    """Checks whether `chars` is a control character."""
-    # These are technically control characters but we count them as whitespace
-    # characters.
-    if char == "\t" or char == "\n" or char == "\r":
-        return False
-    cat = unicodedata.category(char)
-    if cat.startswith("C"):
-        return True
-    return False
-
-
-def _is_punctuation(char):
-    """Checks whether `chars` is a punctuation character."""
-    cp = ord(char)
-    # We treat all non-letter/number ASCII as punctuation.
-    # Characters such as "^", "$", and "`" are not in the Unicode
-    # Punctuation class but we treat them as punctuation anyways, for
-    # consistency.
-    if ((cp >= 33 and cp <= 47) or (cp >= 58 and cp <= 64) or
-        (cp >= 91 and cp <= 96) or (cp >= 123 and cp <= 126)):
-        return True
-    cat = unicodedata.category(char)
-    if cat.startswith("P"):
-        return True
-    return False
--- a/python/examples/blazeface/README.md
+++ b/python/examples/blazeface/README.md
-# Blazeface 
-
-## Get Model
-```
-python3 -m paddle_serving_app.package --get_model blazeface
-tar -xf blazeface.tar.gz
-```
-
-## RPC Service
-
-### Start Service
-
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9494
-```
-
-### Client Prediction
-
-```
-python3 test_client.py serving_client/serving_client_conf.prototxt test.jpg
-```
-
-the result is in `output` folder, including a json file and image file with bounding boxes.
--- a/python/examples/blazeface/test_client.py
+++ b/python/examples/blazeface/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import sys
-import numpy as np
-from paddle_serving_app.reader import BlazeFacePostprocess
-
-preprocess = Sequential([
-    File2Image(),
-    Normalize([104, 117, 123], [127.502231, 127.502231, 127.502231], False)
-])
-
-postprocess = BlazeFacePostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config(sys.argv[1])
-client.connect(['127.0.0.1:9494'])
-
-im_0 = preprocess(sys.argv[2])
-tmp = Transpose((2, 0, 1))
-im = tmp(im_0)
-fetch_map = client.predict(
-    feed={"image": im}, fetch=["detection_output_0.tmp_0"])
-fetch_map["image"] = sys.argv[2]
-fetch_map["im_shape"] = im_0.shape
-postprocess(fetch_map)
--- a/python/examples/cascade_rcnn/000000570688.jpg
+++ b/python/examples/cascade_rcnn/000000570688.jpg
--- a/python/examples/cascade_rcnn/README.md
+++ b/python/examples/cascade_rcnn/README.md
-# Cascade RCNN model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get The Cascade RCNN Model
-```
-sh get_data.sh
-```
-If you want to have more detection models, please refer to [Paddle Detection Model Zoo](https://github.com/PaddlePaddle/PaddleDetection/blob/release/0.2/docs/MODEL_ZOO_cn.md)
-
-### Start the service
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9292 --gpu_id 0
-```
-
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
-
-Image with bounding boxes and json result would be saved in `output` folder.
--- a/python/examples/cascade_rcnn/README_CN.md
+++ b/python/examples/cascade_rcnn/README_CN.md
-# 使用Paddle Serving部署Cascade RCNN模型
-
-(简体中文|[English](./README.md))
-
-## 获得Cascade RCNN模型
-```
-sh get_data.sh
-```
-如果你想要更多的检测模型，请参考[Paddle检测模型库](https://github.com/PaddlePaddle/PaddleDetection/blob/release/0.2/docs/MODEL_ZOO_cn.md)
-
-### 启动服务
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9292 --gpu_id 0
-```
-
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
-
-客户端已经为图片做好了后处理，在`output`文件夹下存放各个框的json格式信息还有后处理结果图片。
--- a/python/examples/cascade_rcnn/get_data.sh
+++ b/python/examples/cascade_rcnn/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/cascade_mask_rcnn_r50_vd_fpn_ssld_2x_coco_serving.tar.gz
-tar xf cascade_mask_rcnn_r50_vd_fpn_ssld_2x_coco_serving.tar.gz
--- a/python/examples/cascade_rcnn/label_list.txt
+++ b/python/examples/cascade_rcnn/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/cascade_rcnn/test_client.py
+++ b/python/examples/cascade_rcnn/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize((800, 1333), True, interpolation=2), 
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(32)
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9292'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/criteo_ctr/README.md
+++ b/python/examples/criteo_ctr/README.md
-## CTR Prediction Service
-
-([简体中文](./README_CN.md)|English)
-
-### download criteo dataset
-```
-sh get_data.sh
-```
-
-### download inference model
-```
-wget https://paddle-serving.bj.bcebos.com/criteo_ctr_example/criteo_ctr_demo_model.tar.gz
-tar xf criteo_ctr_demo_model.tar.gz
-mv models/ctr_client_conf .
-mv models/ctr_serving_model .
-```
-the directories like `ctr_serving_model` and `ctr_client_conf` will appear.
-
-### Start RPC Inference Service
-
-```
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 #CPU RPC Service
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 --gpu_ids 0 #RPC Service on GPU 0
-```
-
-### RPC Infer
-
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt raw_data/part-0
-```
-the latency will display in the end.
--- a/python/examples/criteo_ctr/README_CN.md
+++ b/python/examples/criteo_ctr/README_CN.md
-## CTR预测服务
-
-(简体中文|[English](./README.md))
-
-### 获取样例数据
-```
-sh get_data.sh
-```
-
-### 下载模型
-```
-wget https://paddle-serving.bj.bcebos.com/criteo_ctr_example/criteo_ctr_demo_model.tar.gz
-tar xf criteo_ctr_demo_model.tar.gz
-mv models/ctr_client_conf .
-mv models/ctr_serving_model .
-```
-会在当前目录出现`ctr_serving_model` 和 `ctr_client_conf`文件夹。
-
-### 启动RPC预测服务
-
-```
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 #启动CPU预测服务
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 --gpu_ids 0 #在GPU 0上启动预测服务
-```
-
-### 执行预测
-
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt raw_data/part-0
-```
-预测完毕会输出预测过程的耗时。
--- a/python/examples/criteo_ctr/args.py
+++ b/python/examples/criteo_ctr/args.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import argparse
-
-
-def parse_args():
-    parser = argparse.ArgumentParser(description="PaddlePaddle CTR example")
-    parser.add_argument(
-        '--train_data_path',
-        type=str,
-        default='./data/raw/train.txt',
-        help="The path of training dataset")
-    parser.add_argument(
-        '--sparse_only',
-        type=bool,
-        default=False,
-        help="Whether we use sparse features only")
-    parser.add_argument(
-        '--test_data_path',
-        type=str,
-        default='./data/raw/valid.txt',
-        help="The path of testing dataset")
-    parser.add_argument(
-        '--batch_size',
-        type=int,
-        default=1000,
-        help="The size of mini-batch (default:1000)")
-    parser.add_argument(
-        '--embedding_size',
-        type=int,
-        default=10,
-        help="The size for embedding layer (default:10)")
-    parser.add_argument(
-        '--num_passes',
-        type=int,
-        default=10,
-        help="The number of passes to train (default: 10)")
-    parser.add_argument(
-        '--model_output_dir',
-        type=str,
-        default='models',
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--sparse_feature_dim',
-        type=int,
-        default=1000001,
-        help='sparse feature hashing space for index processing')
-    parser.add_argument(
-        '--is_local',
-        type=int,
-        default=1,
-        help='Local train or distributed train (default: 1)')
-    parser.add_argument(
-        '--cloud_train',
-        type=int,
-        default=0,
-        help='Local train or distributed train on paddlecloud (default: 0)')
-    parser.add_argument(
-        '--async_mode',
-        action='store_true',
-        default=False,
-        help='Whether start pserver in async mode to support ASGD')
-    parser.add_argument(
-        '--no_split_var',
-        action='store_true',
-        default=False,
-        help='Whether split variables into blocks when update_method is pserver')
-    parser.add_argument(
-        '--role',
-        type=str,
-        default='pserver',  # trainer or pserver
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--endpoints',
-        type=str,
-        default='127.0.0.1:6000',
-        help='The pserver endpoints, like: 127.0.0.1:6000,127.0.0.1:6001')
-    parser.add_argument(
-        '--current_endpoint',
-        type=str,
-        default='127.0.0.1:6000',
-        help='The path for model to store (default: 127.0.0.1:6000)')
-    parser.add_argument(
-        '--trainer_id',
-        type=int,
-        default=0,
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--trainers',
-        type=int,
-        default=1,
-        help='The num of trianers, (default: 1)')
-    return parser.parse_args()
--- a/python/examples/criteo_ctr/benchmark.py
+++ b/python/examples/criteo_ctr/benchmark.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args
-import requests
-import json
-import criteo_reader as criteo
-
-args = benchmark_args()
-
-
-def single_func(idx, resource):
-    batch = 1
-    buf_size = 100
-    dataset = criteo.CriteoDataset()
-    dataset.setup(1000001)
-    test_filelists = [
-        "./raw_data/part-%d" % x for x in range(len(os.listdir("./raw_data")))
-    ]
-    reader = dataset.infer_reader(test_filelists[len(test_filelists) - 40:],
-                                  batch, buf_size)
-    if args.request == "rpc":
-        fetch = ["prob"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-
-        start = time.time()
-        for i in range(1000):
-            if args.batch_size == 1:
-                data = reader().next()
-                feed_dict = {}
-                for i in range(1, 27):
-                    feed_dict["sparse_{}".format(i - 1)] = data[0][i]
-                result = client.predict(feed=feed_dict, fetch=fetch)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-
-    elif args.request == "http":
-        raise ("Not support http service.")
-    end = time.time()
-    return [[end - start]]
-
-
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292"]
-    #endpoint_list = endpoint_list + endpoint_list + endpoint_list
-    result = multi_thread_runner.run(single_func, args.thread,
-                                     {"endpoint": endpoint_list})
-    #result = single_func(0, {"endpoint": endpoint_list})
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-    print("average total cost {} s.".format(avg_cost))
--- a/python/examples/criteo_ctr/benchmark.sh
+++ b/python/examples/criteo_ctr/benchmark.sh
-rm profile_log
-for thread_num in 1 2 4 8 16
-do
-    $PYTHONROOT/bin/python benchmark.py --thread $thread_num --model ctr_client_conf/serving_client_conf.prototxt --request rpc > profile 2>&1
-    echo "========================================"
-    echo "batch size : $batch_size" >> profile_log
-    $PYTHONROOT/bin/python ../util/show_profile.py profile $thread_num >> profile_log
-    tail -n 1 profile >> profile_log
-done
--- a/python/examples/criteo_ctr/benchmark_batch.py
+++ b/python/examples/criteo_ctr/benchmark_batch.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args
-import requests
-import json
-import criteo_reader as criteo
-
-args = benchmark_args()
-
-
-def single_func(idx, resource):
-    batch = 1
-    buf_size = 100
-    dataset = criteo.CriteoDataset()
-    dataset.setup(1000001)
-    test_filelists = [
-        "./raw_data/part-%d" % x for x in range(len(os.listdir("./raw_data")))
-    ]
-    reader = dataset.infer_reader(test_filelists[len(test_filelists) - 40:],
-                                  batch, buf_size)
-    if args.request == "rpc":
-        fetch = ["prob"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-
-        start = time.time()
-        for i in range(1000):
-            if args.batch_size >= 1:
-                feed_batch = []
-                for bi in range(args.batch_size):
-                    feed_dict = {}
-                    data = reader().next()
-                    for i in range(1, 27):
-                        feed_dict["sparse_{}".format(i - 1)] = data[0][i]
-                    feed_batch.append(feed_dict)
-                result = client.predict(feed=feed_batch, fetch=fetch)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-
-    elif args.request == "http":
-        raise ("no batch predict for http")
-    end = time.time()
-    return [[end - start]]
-
-
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292"]
-    #endpoint_list = endpoint_list + endpoint_list + endpoint_list
-    result = multi_thread_runner.run(single_func, args.thread,
-                                     {"endpoint": endpoint_list})
-    #result = single_func(0, {"endpoint": endpoint_list})
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-    print("average total cost {} s.".format(avg_cost))
--- a/python/examples/criteo_ctr/benchmark_batch.sh
+++ b/python/examples/criteo_ctr/benchmark_batch.sh
-rm profile_log
-for thread_num in 1 2 4 8 16
-do
-for batch_size in 1 2 4 8 16 32 64 128 256 512
-do
-    $PYTHONROOT/bin/python benchmark_batch.py --thread $thread_num --batch_size $batch_size --model serving_client_conf/serving_client_conf.prototxt --request rpc > profile 2>&1
-    echo "========================================"
-    echo "batch size : $batch_size" >> profile_log
-    $PYTHONROOT/bin/python ../util/show_profile.py profile $thread_num >> profile_log
-    tail -n 1 profile >> profile_log
-done
-done
--- a/python/examples/criteo_ctr/clean.sh
+++ b/python/examples/criteo_ctr/clean.sh
-rm -rf *pyc kvdb raw_data ctr_client_conf ctr_serving_model ctr_data.tar.gz *~
--- a/python/examples/criteo_ctr/get_data.sh
+++ b/python/examples/criteo_ctr/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/data/ctr_prediction/ctr_data.tar.gz
-tar -zxvf ctr_data.tar.gz
--- a/python/examples/criteo_ctr/local_train.py
+++ b/python/examples/criteo_ctr/local_train.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from __future__ import print_function
-
-from args import parse_args
-import os
-import paddle.fluid as fluid
-import sys
-from network_conf import dnn_model
-
-dense_feature_dim = 13
-
-
-def train():
-    args = parse_args()
-    sparse_only = args.sparse_only
-    if not os.path.isdir(args.model_output_dir):
-        os.mkdir(args.model_output_dir)
-    dense_input = fluid.layers.data(
-        name="dense_input", shape=[dense_feature_dim], dtype='float32')
-    sparse_input_ids = [
-        fluid.layers.data(
-            name="C" + str(i), shape=[1], lod_level=1, dtype="int64")
-        for i in range(1, 27)
-    ]
-    label = fluid.layers.data(name='label', shape=[1], dtype='int64')
-
-    #nn_input = None if sparse_only else dense_input
-    nn_input = dense_input
-    predict_y, loss, auc_var, batch_auc_var = dnn_model(
-        nn_input, sparse_input_ids, label, args.embedding_size,
-        args.sparse_feature_dim)
-
-    optimizer = fluid.optimizer.SGD(learning_rate=1e-4)
-    optimizer.minimize(loss)
-
-    exe = fluid.Executor(fluid.CPUPlace())
-    exe.run(fluid.default_startup_program())
-    dataset = fluid.DatasetFactory().create_dataset("InMemoryDataset")
-    dataset.set_use_var([dense_input] + sparse_input_ids + [label])
-
-    python_executable = "python"
-    pipe_command = "{} criteo_reader.py {}".format(python_executable,
-                                                   args.sparse_feature_dim)
-
-    dataset.set_pipe_command(pipe_command)
-    dataset.set_batch_size(128)
-    thread_num = 10
-    dataset.set_thread(thread_num)
-
-    whole_filelist = [
-        "raw_data/part-%d" % x for x in range(len(os.listdir("raw_data")))
-    ]
-
-    dataset.set_filelist(whole_filelist[:thread_num])
-    dataset.load_into_memory()
-
-    epochs = 1
-    for i in range(epochs):
-        exe.train_from_dataset(
-            program=fluid.default_main_program(), dataset=dataset, debug=True)
-        print("epoch {} finished".format(i))
-
-    import paddle_serving_client.io as server_io
-    feed_var_dict = {}
-    for i, sparse in enumerate(sparse_input_ids):
-        feed_var_dict["sparse_{}".format(i)] = sparse
-    fetch_var_dict = {"prob": predict_y}
-
-    server_io.save_model("ctr_serving_model", "ctr_client_conf", feed_var_dict,
-                         fetch_var_dict, fluid.default_main_program())
-
-
-if __name__ == '__main__':
-    train()
--- a/python/examples/criteo_ctr/network_conf.py
+++ b/python/examples/criteo_ctr/network_conf.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-import paddle.fluid as fluid
-import math
-
-
-def dnn_model(dense_input, sparse_inputs, label, embedding_size,
-              sparse_feature_dim):
-    def embedding_layer(input):
-        emb = fluid.layers.embedding(
-            input=input,
-            is_sparse=True,
-            is_distributed=False,
-            size=[sparse_feature_dim, embedding_size],
-            param_attr=fluid.ParamAttr(
-                name="SparseFeatFactors",
-                initializer=fluid.initializer.Uniform()))
-        return fluid.layers.sequence_pool(input=emb, pool_type='sum')
-
-    def mlp_input_tensor(emb_sums, dense_tensor):
-        if isinstance(dense_tensor, fluid.Variable):
-            return fluid.layers.concat(emb_sums, axis=1)
-        else:
-            return fluid.layers.concat(emb_sums + [dense_tensor], axis=1)
-
-    def mlp(mlp_input):
-        fc1 = fluid.layers.fc(input=mlp_input,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(mlp_input.shape[1]))))
-        fc2 = fluid.layers.fc(input=fc1,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc1.shape[1]))))
-        fc3 = fluid.layers.fc(input=fc2,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc2.shape[1]))))
-        pre = fluid.layers.fc(input=fc3,
-                              size=2,
-                              act='softmax',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc3.shape[1]))))
-        return pre
-
-    emb_sums = list(map(embedding_layer, sparse_inputs))
-    mlp_in = mlp_input_tensor(emb_sums, dense_input)
-    predict = mlp(mlp_in)
-    cost = fluid.layers.cross_entropy(input=predict, label=label)
-    avg_cost = fluid.layers.reduce_sum(cost)
-    accuracy = fluid.layers.accuracy(input=predict, label=label)
-    auc_var, batch_auc_var, auc_states = \
-        fluid.layers.auc(input=predict, label=label, num_thresholds=2 ** 12, slide_steps=20)
-    return predict, avg_cost, auc_var, batch_auc_var
--- a/python/examples/criteo_ctr/test_client.py
+++ b/python/examples/criteo_ctr/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from paddle_serving_client import Client
-import sys
-import os
-import time
-from paddle_serving_client.metric import auc
-import numpy as np
-import sys
-
-class CriteoReader(object):
-    def __init__(self, sparse_feature_dim):
-        self.cont_min_ = [0, -3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
-        self.cont_max_ = [
-            20, 600, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.cont_diff_ = [
-            20, 603, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.hash_dim_ = sparse_feature_dim
-        # here, training data are lines with line_index < train_idx_
-        self.train_idx_ = 41256555
-        self.continuous_range_ = range(1, 14)
-        self.categorical_range_ = range(14, 40)
-
-    def process_line(self, line):
-        features = line.rstrip('\n').split('\t')
-        dense_feature = []
-        sparse_feature = []
-        for idx in self.continuous_range_:
-            if features[idx] == '':
-                dense_feature.append(0.0)
-            else:
-                dense_feature.append((float(features[idx]) - self.cont_min_[idx - 1]) / \
-                                     self.cont_diff_[idx - 1])
-        for idx in self.categorical_range_:
-            sparse_feature.append(
-                [hash(str(idx) + features[idx]) % self.hash_dim_])
-
-        return sparse_feature
-
-py_version = sys.version_info[0]
-
-client = Client()
-client.load_client_config(sys.argv[1])
-client.connect(["127.0.0.1:9292"])
-reader = CriteoReader(1000001)
-batch = 1
-buf_size = 100
-label_list = []
-prob_list = []
-start = time.time()
-f = open(sys.argv[2], 'r')
-for ei in range(10):
-    data = reader.process_line(f.readline())
-    feed_dict = {}
-    for i in range(1, 27):
-        feed_dict["sparse_{}".format(i - 1)] = np.array(data[i-1]).reshape(-1)
-        feed_dict["sparse_{}.lod".format(i - 1)] = [0, len(data[i-1])]
-    fetch_map = client.predict(feed=feed_dict, fetch=["prob"])
-    print(fetch_map)
-end = time.time()
-f.close()
--- a/python/examples/criteo_ctr/test_server.py
+++ b/python/examples/criteo_ctr/test_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-response_op = op_maker.create('general_response')
-
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(response_op)
-
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.load_model_config(sys.argv[1])
-server.prepare_server(workdir="work_dir1", port=9292, device="cpu")
-server.run_server()
--- a/python/examples/criteo_ctr_with_cube/README.md
+++ b/python/examples/criteo_ctr_with_cube/README.md
-## Criteo CTR with Sparse Parameter Indexing Service
-
-([简体中文](./README_CN.md)|English)
-
-### Get Sample Dataset
-
-go to directory `python/examples/criteo_ctr_with_cube`
-```
-sh get_data.sh
-```
-
-### Download Model and Sparse Parameter Sequence Files
-```
-wget https://paddle-serving.bj.bcebos.com/unittest/ctr_cube_unittest.tar.gz
-tar xf ctr_cube_unittest.tar.gz
-mv models/ctr_client_conf ./
-mv models/ctr_serving_model_kv ./
-mv models/data ./cube/
-```
-the model will be in ./ctr_server_model_kv and ./ctr_client_config.
-
-### Start Sparse Parameter Indexing Service
-```
-wget https://paddle-serving.bj.bcebos.com/others/cube_app.tar.gz
-tar xf cube_app.tar.gz
-mv cube_app/cube* ./cube/
-sh cube_prepare.sh &
-```
-
-Here, the sparse parameter is loaded by cube sparse parameter indexing service Cube.
-
-### Start RPC Predictor, the number of serving thread is 4（configurable in test_server.py）
-
-```
-python3 test_server.py ctr_serving_model_kv 
-```
-
-### Run Prediction
-
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt ./raw_data
-```
-
-### Benchmark
-
-CPU ：Intel(R) Xeon(R) CPU 6148 @ 2.40GHz 
-
-Model ：[Criteo CTR](https://github.com/PaddlePaddle/Serving/blob/develop/python/examples/criteo_ctr_with_cube/network_conf.py)
-
-server core/thread num ： 4/8
-
-Run
-```
-bash benchmark.sh
-```
-1000 batches will be sent by every client
-
-| client  thread num | prepro | client infer | op0    | op1   | op2    | postpro | avg_latency | qps   |
-| ------------------ | ------ | ------------ | ------ | ----- | ------ | ------- | ----- | ----- |
-| 1                  | 0.035  | 1.596        | 0.021  | 0.518 | 0.0024 | 0.0025  | 6.774 | 147.7 |
-| 2                  | 0.034  | 1.780        | 0.027  | 0.463 | 0.0020 | 0.0023  | 6.931 | 288.3 |
-| 4                  | 0.038  | 2.954        | 0.025  | 0.455 | 0.0019 | 0.0027  | 8.378 | 477.5 |
-| 8                  | 0.044  | 8.230        | 0.028  | 0.464 | 0.0023 | 0.0034  | 14.191 | 563.8 |
-| 16                 | 0.048  | 21.037       | 0.028  | 0.455 | 0.0025 | 0.0041  | 27.236 | 587.5 |
-
-the average latency of threads
-
-![avg cost](../../../doc/images/criteo-cube-benchmark-avgcost.png)
-
-The QPS is 
-
-![qps](../../../doc/images/criteo-cube-benchmark-qps.png)
--- a/python/examples/criteo_ctr_with_cube/README_CN.md
+++ b/python/examples/criteo_ctr_with_cube/README_CN.md
-## 带稀疏参数索引服务的CTR预测服务
-(简体中文|[English](./README.md))
-
-### 获取样例数据
-进入目录 `python/examples/criteo_ctr_with_cube`
-```
-sh get_data.sh
-```
-
-### 下载模型和稀疏参数序列文件
-```
-wget https://paddle-serving.bj.bcebos.com/unittest/ctr_cube_unittest.tar.gz
-tar xf ctr_cube_unittest.tar.gz
-mv models/ctr_client_conf ./
-mv models/ctr_serving_model_kv ./
-mv models/data ./cube/
-```
-执行脚本后会在当前目录有ctr_server_model_kv和ctr_client_config文件夹。
-
-### 启动稀疏参数索引服务
-```
-wget https://paddle-serving.bj.bcebos.com/others/cube_app.tar.gz
-tar xf cube_app.tar.gz
-mv cube_app/cube* ./cube/
-sh cube_prepare.sh &
-```
-
-此处，模型当中的稀疏参数会被存放在稀疏参数索引服务Cube当中。
-
-### 启动RPC预测服务，服务端线程数为4（可在test_server.py配置）
-
-```
-python3 test_server.py ctr_serving_model_kv 
-```
-
-### 执行预测
-
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt ./raw_data
-```
-
-### Benchmark
-
-设备 ：Intel(R) Xeon(R) CPU 6148 @ 2.40GHz 
-
-模型 ：[Criteo CTR](https://github.com/PaddlePaddle/Serving/blob/develop/python/examples/criteo_ctr_with_cube/network_conf.py)
-
-server core/thread num ： 4/8
-
-执行
-```
-bash benchmark.sh
-```
-客户端每个线程会发送1000个batch
-
-| client  thread num | prepro | client infer | op0    | op1   | op2    | postpro | avg_latency | qps   |
-| ------------------ | ------ | ------------ | ------ | ----- | ------ | ------- | ----- | ----- |
-| 1                  | 0.035  | 1.596        | 0.021  | 0.518 | 0.0024 | 0.0025  | 6.774 | 147.7 |
-| 2                  | 0.034  | 1.780        | 0.027  | 0.463 | 0.0020 | 0.0023  | 6.931 | 288.3 |
-| 4                  | 0.038  | 2.954        | 0.025  | 0.455 | 0.0019 | 0.0027  | 8.378 | 477.5 |
-| 8                  | 0.044  | 8.230        | 0.028  | 0.464 | 0.0023 | 0.0034  | 14.191 | 563.8 |
-| 16                 | 0.048  | 21.037       | 0.028  | 0.455 | 0.0025 | 0.0041  | 27.236 | 587.5 |
-
-平均每个线程耗时图如下
-
-![avg cost](../../../doc/images/criteo-cube-benchmark-avgcost.png)
-
-每个线程QPS耗时如下
-
-![qps](../../../doc/images/criteo-cube-benchmark-qps.png)
--- a/python/examples/criteo_ctr_with_cube/criteo_reader.py
+++ b/python/examples/criteo_ctr_with_cube/criteo_reader.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-import sys
-import paddle.fluid.incubate.data_generator as dg
-
-
-class CriteoDataset(dg.MultiSlotDataGenerator):
-    def setup(self, sparse_feature_dim):
-        self.cont_min_ = [0, -3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
-        self.cont_max_ = [
-            20, 600, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.cont_diff_ = [
-            20, 603, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.hash_dim_ = sparse_feature_dim
-        # here, training data are lines with line_index < train_idx_
-        self.train_idx_ = 41256555
-        self.continuous_range_ = range(1, 14)
-        self.categorical_range_ = range(14, 40)
-
-    def _process_line(self, line):
-        features = line.rstrip('\n').split('\t')
-        dense_feature = []
-        sparse_feature = []
-        for idx in self.continuous_range_:
-            if features[idx] == '':
-                dense_feature.append(0.0)
-            else:
-                dense_feature.append((float(features[idx]) - self.cont_min_[idx - 1]) / \
-                                     self.cont_diff_[idx - 1])
-        for idx in self.categorical_range_:
-            sparse_feature.append(
-                [hash(str(idx) + features[idx]) % self.hash_dim_])
-
-        return dense_feature, sparse_feature, [int(features[0])]
-
-    def infer_reader(self, filelist, batch, buf_size):
-        def local_iter():
-            for fname in filelist:
-                with open(fname.strip(), "r") as fin:
-                    for line in fin:
-                        dense_feature, sparse_feature, label = self._process_line(
-                            line)
-                        #yield dense_feature, sparse_feature, label
-                        yield [dense_feature] + sparse_feature + [label]
-
-        import paddle
-        batch_iter = paddle.batch(
-            paddle.reader.shuffle(
-                local_iter, buf_size=buf_size),
-            batch_size=batch)
-        return batch_iter
-
-    def generate_sample(self, line):
-        def data_iter():
-            dense_feature, sparse_feature, label = self._process_line(line)
-            feature_name = ["dense_input"]
-            for idx in self.categorical_range_:
-                feature_name.append("C" + str(idx - 13))
-            feature_name.append("label")
-            yield zip(feature_name, [dense_feature] + sparse_feature + [label])
-
-        return data_iter
-
-
-if __name__ == "__main__":
-    criteo_dataset = CriteoDataset()
-    criteo_dataset.setup(int(sys.argv[1]))
-    criteo_dataset.run_from_stdin()
--- a/python/examples/criteo_ctr_with_cube/cube/conf/cube.conf
+++ b/python/examples/criteo_ctr_with_cube/cube/conf/cube.conf
-[{
-    "dict_name": "test_dict",
-    "shard": 1,
-    "dup": 1,
-    "timeout": 200,
-    "retry": 3,
-    "backup_request": 100,
-    "type": "ipport_list",
-    "load_balancer": "rr",
-    "nodes": [{
-        "ipport_list": "list://127.0.0.1:8027"
-    }]
-}]
--- a/python/examples/criteo_ctr_with_cube/cube/conf/gflags.conf
+++ b/python/examples/criteo_ctr_with_cube/cube/conf/gflags.conf
--port=8027
--dict_split=1
--in_mem=true
--log_dir=./log/
--- a/python/examples/criteo_ctr_with_cube/cube/keys
+++ b/python/examples/criteo_ctr_with_cube/cube/keys
-1
-2
-3
-4
-5
-6
-7
-8
-9
-10
--- a/python/examples/criteo_ctr_with_cube/cube_prepare.sh
+++ b/python/examples/criteo_ctr_with_cube/cube_prepare.sh
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-#! /bin/bash
-
-mkdir -p cube_model
-mkdir -p cube/data
-./cube/cube-builder -dict_name=test_dict -job_mode=base -last_version=0 -cur_version=0 -depend_version=0 -input_path=./cube_model -output_path=${PWD}/cube/data -shard_num=1  -only_build=false
-cd cube && ./cube
--- a/python/examples/criteo_ctr_with_cube/get_data.sh
+++ b/python/examples/criteo_ctr_with_cube/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/data/ctr_prediction/ctr_data.tar.gz
-tar -zxvf ctr_data.tar.gz
--- a/python/examples/criteo_ctr_with_cube/local_train.py
+++ b/python/examples/criteo_ctr_with_cube/local_train.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from __future__ import print_function
-
-from args import parse_args
-import os
-import paddle.fluid as fluid
-import paddle
-import sys
-from network_conf import dnn_model
-
-dense_feature_dim = 13
-
-paddle.enable_static()
-def train():
-    args = parse_args()
-    sparse_only = args.sparse_only
-    if not os.path.isdir(args.model_output_dir):
-        os.mkdir(args.model_output_dir)
-    dense_input = fluid.layers.data(
-        name="dense_input", shape=[dense_feature_dim], dtype='float32')
-    sparse_input_ids = [
-        fluid.layers.data(
-            name="C" + str(i), shape=[1], lod_level=1, dtype="int64")
-        for i in range(1, 27)
-    ]
-    label = fluid.layers.data(name='label', shape=[1], dtype='int64')
-
-    #nn_input = None if sparse_only else dense_input
-    nn_input = dense_input
-    predict_y, loss, auc_var, batch_auc_var, infer_vars = dnn_model(
-        nn_input, sparse_input_ids, label, args.embedding_size,
-        args.sparse_feature_dim)
-
-    optimizer = fluid.optimizer.SGD(learning_rate=1e-4)
-    optimizer.minimize(loss)
-
-    exe = fluid.Executor(fluid.CPUPlace())
-    exe.run(fluid.default_startup_program())
-    dataset = fluid.DatasetFactory().create_dataset("InMemoryDataset")
-    dataset.set_use_var([dense_input] + sparse_input_ids + [label])
-
-    python_executable = "python3.6"
-    pipe_command = "{} criteo_reader.py {}".format(python_executable,
-                                                   args.sparse_feature_dim)
-
-    dataset.set_pipe_command(pipe_command)
-    dataset.set_batch_size(128)
-    thread_num = 10
-    dataset.set_thread(thread_num)
-
-    whole_filelist = [
-        "raw_data/part-%d" % x for x in range(len(os.listdir("raw_data")))
-    ]
-
-    print(whole_filelist)
-    dataset.set_filelist(whole_filelist[:100])
-    dataset.load_into_memory()
-    fluid.layers.Print(auc_var)
-    epochs = 1
-    for i in range(epochs):
-        exe.train_from_dataset(
-            program=fluid.default_main_program(), dataset=dataset, debug=True)
-        print("epoch {} finished".format(i))
-
-    import paddle_serving_client.io as server_io
-    feed_var_dict = {}
-    feed_var_dict['dense_input'] = dense_input
-    for i, sparse in enumerate(sparse_input_ids):
-        feed_var_dict["embedding_{}.tmp_0".format(i)] = sparse
-    fetch_var_dict = {"prob": predict_y}
-
-    feed_kv_dict = {}
-    feed_kv_dict['dense_input'] = dense_input
-    for i, emb in enumerate(infer_vars):
-        feed_kv_dict["embedding_{}.tmp_0".format(i)] = emb
-    fetch_var_dict = {"prob": predict_y}
-
-    server_io.save_model("ctr_serving_model", "ctr_client_conf", feed_var_dict,
-                         fetch_var_dict, fluid.default_main_program())
-
-    server_io.save_model("ctr_serving_model_kv", "ctr_client_conf_kv",
-                         feed_kv_dict, fetch_var_dict,
-                         fluid.default_main_program())
-
-
-if __name__ == '__main__':
-    train()
--- a/python/examples/criteo_ctr_with_cube/network_conf.py
+++ b/python/examples/criteo_ctr_with_cube/network_conf.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-import paddle.fluid as fluid
-import math
-
-
-def dnn_model(dense_input, sparse_inputs, label, embedding_size,
-              sparse_feature_dim):
-    def embedding_layer(input):
-        emb = fluid.layers.embedding(
-            input=input,
-            is_sparse=True,
-            is_distributed=False,
-            size=[sparse_feature_dim, embedding_size],
-            param_attr=fluid.ParamAttr(
-                name="SparseFeatFactors",
-                initializer=fluid.initializer.Uniform()))
-        x = fluid.layers.sequence_pool(input=emb, pool_type='sum')
-        return emb, x
-
-    def mlp_input_tensor(emb_sums, dense_tensor):
-        #if isinstance(dense_tensor, fluid.Variable):
-        #    return fluid.layers.concat(emb_sums, axis=1)
-        #else:
-        return fluid.layers.concat(emb_sums + [dense_tensor], axis=1)
-
-    def mlp(mlp_input):
-        fc1 = fluid.layers.fc(input=mlp_input,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(mlp_input.shape[1]))))
-        fc2 = fluid.layers.fc(input=fc1,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc1.shape[1]))))
-        fc3 = fluid.layers.fc(input=fc2,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc2.shape[1]))))
-        pre = fluid.layers.fc(input=fc3,
-                              size=2,
-                              act='softmax',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc3.shape[1]))))
-        return pre
-
-    emb_pair_sums = list(map(embedding_layer, sparse_inputs))
-    emb_sums = [x[1] for x in emb_pair_sums]
-    infer_vars = [x[0] for x in emb_pair_sums]
-    mlp_in = mlp_input_tensor(emb_sums, dense_input)
-    predict = mlp(mlp_in)
-    cost = fluid.layers.cross_entropy(input=predict, label=label)
-    avg_cost = fluid.layers.reduce_sum(cost)
-    accuracy = fluid.layers.accuracy(input=predict, label=label)
-    auc_var, batch_auc_var, auc_states = \
-        fluid.layers.auc(input=predict, label=label, num_thresholds=2 ** 12, slide_steps=20)
-    return predict, avg_cost, auc_var, batch_auc_var, infer_vars
--- a/python/examples/criteo_ctr_with_cube/test_client.py
+++ b/python/examples/criteo_ctr_with_cube/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-from paddle_serving_client import Client
-import sys
-import os
-import criteo_reader as criteo
-import time
-from paddle_serving_client.metric import auc
-import numpy as np
-py_version = sys.version_info[0]
-
-client = Client()
-client.load_client_config(sys.argv[1])
-client.connect(["127.0.0.1:9292"])
-
-batch = 1
-buf_size = 100
-dataset = criteo.CriteoDataset()
-dataset.setup(1000001)
-test_filelists = ["{}/part-0".format(sys.argv[2])]
-reader = dataset.infer_reader(test_filelists, batch, buf_size)
-label_list = []
-prob_list = []
-start = time.time()
-for ei in range(100):
-    if py_version == 2:
-        data = reader().next()
-    else:
-        data = reader().__next__()
-    feed_dict = {}
-    feed_dict['dense_input'] = np.array(data[0][0]).reshape(1, len(data[0][0]))
-
-    for i in range(1, 27):
-        feed_dict["embedding_{}.tmp_0".format(i - 1)] = np.array(data[0][i]).reshape(len(data[0][i]))
-        feed_dict["embedding_{}.tmp_0.lod".format(i - 1)] = [0, len(data[0][i])]
-    fetch_map = client.predict(feed=feed_dict, fetch=["prob"],batch=True)
-    print(fetch_map)
-    prob_list.append(fetch_map['prob'][0][1])
-    label_list.append(data[0][-1][0])
-
-
-end = time.time()
-print(end - start)
-
--- a/python/examples/criteo_ctr_with_cube/test_server.py
+++ b/python/examples/criteo_ctr_with_cube/test_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_dist_kv_infer_op = op_maker.create('general_dist_kv_infer')
-response_op = op_maker.create('general_response')
-
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_dist_kv_infer_op)
-op_seq_maker.add_op(response_op)
-
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(4)
-server.load_model_config(sys.argv[1])
-server.prepare_server(
-    workdir="work_dir1",
-    port=9292,
-    device="cpu",
-    cube_conf="./cube/conf/cube.conf")
-server.run_server()
--- a/python/examples/deeplabv3/N0060.jpg
+++ b/python/examples/deeplabv3/N0060.jpg
--- a/python/examples/deeplabv3/README.md
+++ b/python/examples/deeplabv3/README.md
-# Image Segmentation
-
-## Get Model
-
-```
-python3 -m paddle_serving_app.package --get_model deeplabv3
-tar -xzvf deeplabv3.tar.gz
-```
-
-## RPC Service
-
-### Start Service
-
-```
-python3 -m paddle_serving_server.serve --model deeplabv3_server --gpu_ids 0 --port 9494
-```
-
-### Client Prediction
-
-```
-python3 deeplabv3_client.py
-```
--- a/python/examples/deeplabv3/README_CN.md
+++ b/python/examples/deeplabv3/README_CN.md
-# 图像分割
-
-## 获取模型
-
-```
-python3 -m paddle_serving_app.package --get_model deeplabv3
-tar -xzvf deeplabv3.tar.gz
-```
-
-## RPC 服务
-
-### 启动服务端
-
-```
-python3 -m paddle_serving_server.serve --model deeplabv3_server --gpu_ids 0 --port 9494
-```
-
-### 客户端预测
-
-```
-python3 deeplabv3_client.py
--- a/python/examples/deeplabv3/deeplabv3_client.py
+++ b/python/examples/deeplabv3/deeplabv3_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-from paddle_serving_client import Client
-from paddle_serving_app.reader import Sequential, File2Image, Resize, Transpose, BGR2RGB, SegPostprocess
-import sys
-import cv2
-
-client = Client()
-client.load_client_config("deeplabv3_client/serving_client_conf.prototxt")
-client.connect(["127.0.0.1:9494"])
-
-preprocess = Sequential(
-    [File2Image(), Resize(
-        (512, 512), interpolation=cv2.INTER_LINEAR)])
-
-postprocess = SegPostprocess(2)
-
-filename = "N0060.jpg"
-im = preprocess(filename)
-fetch_map = client.predict(feed={"image": im}, fetch=["output"])
-fetch_map["filename"] = filename
-postprocess(fetch_map)
--- a/python/examples/detection/README.md
+++ b/python/examples/detection/README.md
-# Serve models from Paddle Detection
-
-(English|[简体中文](./README_CN.md))
-
-### Introduction
-
-PaddleDetection flying paddle target detection development kit is designed to help developers complete the whole development process of detection model formation, training, optimization and deployment faster and better. For details, see [Github](https://github.com/PaddlePaddle/PaddleDetection/tree/master)
-
-This article mainly introduces the deployment of Paddle Detection's dynamic graph model on Serving.
-
-Paddle Detection provides a large number of [Model Zoo](https://github.com/PaddlePaddle/PaddleDetection/blob/master/docs/MODEL_ZOO_cn.md), these model libraries can be used in Paddle Serving with export tools Model. For the export tutorial, please refer to [Paddle Detection Export Model Tutorial (Simplified Chinese)](https://github.com/PaddlePaddle/PaddleDetection/blob/master/deploy/EXPORT_MODEL.md).
-
-### Serving example
-Several examples of PaddleDetection models used in Serving are given in this folder
-All examples support TensorRT.
-
- [Faster RCNN](./faster_rcnn_r50_fpn_1x_coco)
- [PPYOLO](./ppyolo_r50vd_dcn_1x_coco)
- [TTFNet](./ttfnet_darknet53_1x_coco)
- [YOLOv3](./yolov3_darknet53_270e_coco)
- [HRNet](./faster_rcnn_hrnetv2p_w18_1x)
- [Fcos](./fcos_dcn_r50_fpn_1x_coco)
- [SSD](./ssd_vgg16_300_240e_voc/)
--- a/python/examples/detection/README_CN.md
+++ b/python/examples/detection/README_CN.md
-## 使用Paddle Detection模型
-
-([English](./README.md)|简体中文)
-
-### 简介
-
-PaddleDetection飞桨目标检测开发套件，旨在帮助开发者更快更好地完成检测模型的组建、训练、优化及部署等全开发流程。详情参见[Github](https://github.com/PaddlePaddle/PaddleDetection/tree/master)
-
-本文主要是介绍Paddle Detection的动态图模型在Serving上的部署。
-
-### 导出模型
-
-Paddle Detection提供了大量的[模型库](https://github.com/PaddlePaddle/PaddleDetection/blob/master/docs/MODEL_ZOO_cn.md), 这些模型库配合导出工具都可以得到可以用于Paddle Serving的模型。导出教程参见[Paddle Detection模型导出教程](https://github.com/PaddlePaddle/PaddleDetection/blob/master/deploy/EXPORT_MODEL.md)。
-
-### Serving示例
-本文件夹下给出了多个PaddleDetection模型用于Serving的范例
-
- [Faster RCNN](./faster_rcnn_r50_fpn_1x_coco)
- [PPYOLO](./ppyolo_r50vd_dcn_1x_coco)
- [TTFNet](./ttfnet_darknet53_1x_coco)
- [YOLOv3](./yolov3_darknet53_270e_coco)
- [HRNet](./faster_rcnn_hrnetv2p_w18_1x)
- [Fcos](./fcos_dcn_r50_fpn_1x_coco)
- [SSD](./ssd_vgg16_300_240e_voc/)
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/000000570688.jpg
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/000000570688.jpg
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
-# Faster RCNN HRNet model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get The Faster RCNN HRNet Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/faster_rcnn_hrnetv2p_w18_1x.tar.gz
-```
-
-### Start the service
-```
-tar xf faster_rcnn_hrnetv2p_w18_1x.tar.gz
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-This model support TensorRT, if you want a faster inference, please use `--use_trt`. But you need to do some extra work.
-Please reference to https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40 
-
-
-### Prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
-# 使用Paddle Serving部署Faster RCNN HRNet模型
-
-(简体中文|[English](./README.md))
-
-## 获得Faster RCNN HRNet模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/faster_rcnn_hrnetv2p_w18_1x.tar.gz
-```
-
-
-### 启动服务
-```
-tar xf faster_rcnn_hrnetv2p_w18_1x.tar.gz
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项,但此时需要额外设置子图的TRT变长最大最小最优shape.
-请参考https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40
-
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/label_list.txt
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/test_client.py
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-       DetectionFile2Image(),
-       DetectionResize((800, 1333), True, interpolation=2), 
-       DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-       DetectionTranspose((2,0,1)),
-       DetectionPadStride(32)
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/000000570688.jpg
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/000000570688.jpg
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README.md
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README.md
-# Faster RCNN model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get The Faster RCNN Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/faster_rcnn_r50_fpn_1x_coco.tar
-```
-
-### Start the service
-```
-tar xf faster_rcnn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-This model support TensorRT, if you want a faster inference, please use `--use_trt`. But you need to do some extra work.
-Please reference to https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40 
-
-
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
-
-## 3. Result analysis
-<p align = "center">
-    <br>
-<img src = '000000570688.jpg'>
-    <br>
-<p>
-This is the input picture
-  
-<p align = "center">
-    <br>
-<img src = '000000570688_bbox.jpg'>
-    <br>
-<p>
-  
-This is the picture after adding bbox. You can see that the client has done post-processing for the picture. In addition, the output/bbox.json also has the number and coordinate information of each box.
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README_CN.md
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README_CN.md
-# 使用Paddle Serving部署Faster RCNN模型
-
-(简体中文|[English](./README.md))
-
-## 获得Faster RCNN模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/faster_rcnn_r50_fpn_1x_coco.tar
-```
-
-
-### 启动服务
-```
-tar xf faster_rcnn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项,但此时需要额外设置子图的TRT变长最大最小最优shape.
-请参考https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40
-
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
-
-## 3. 结果分析
-<p align="center">
-    <br>
-<img src='000000570688.jpg' >
-    <br>
-<p>
-这是输入图片
-  
-<p align="center">
-    <br>
-<img src='000000570688_bbox.jpg' >
-    <br>
-<p>
-这是实现添加了bbox之后的图片，可以看到客户端已经为图片做好了后处理，此外在output/bbox.json也有各个框的编号和坐标信息。
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/label_list.txt
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/test_client.py
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (800, 1333), True, interpolation=cv2.INTER_LINEAR), 
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(128)
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/000000014439.jpg
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/000000014439.jpg
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
-# FCOS model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/fcos_dcn_r50_fpn_1x_coco.tar
-```
-
-### Start the service
-```
-tar xf fcos_dcn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-
-### Perform prediction
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
-# 使用Paddle Serving部署FCOS模型
-
-(简体中文|[English](./README.md))
-
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/fcos_dcn_r50_fpn_1x_coco.tar
-```
-
-
-### 启动服务
-```
-tar xf fcos_dcn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-
-### 执行预测
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/label_list.txt
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/test_client.py
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (800, 1333), True, interpolation=cv2.INTER_LINEAR), 
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(128)
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/000000570688.jpg
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/000000570688.jpg
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README.md
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README.md
-# PP-YOLO model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get The Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ppyolo_r50vd_dcn_1x_coco.tar
-```
-
-### Start the service
-```
-tar xf ppyolo_r50vd_dcn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README_CN.md
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README_CN.md
-# 使用Paddle Serving部署PP-YOLO模型
-
-(简体中文|[English](./README.md))
-
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ppyolo_r50vd_dcn_1x_coco.tar
-```
-
-
-### 启动服务
-```
-tar xf ppyolo_r50vd_dcn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/label_list.txt
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/test_client.py
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (608, 608), False, interpolation=2), 
-        DetectionTranspose((2,0,1))
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/000000014439.jpg
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/000000014439.jpg
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/README.md
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/README.md
-# SSD model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ssd_vgg16_300_240e_voc.tar
-```
-
-### Start the service
-```
-tar xf ssd_vgg16_300_240e_voc.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-
-### Perform prediction
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
-# 使用Paddle Serving部署SSD模型
-
-(简体中文|[English](./README.md))
-
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ssd_vgg16_300_240e_voc.tar
-```
-
-
-### 启动服务
-```
-tar xf ssd_vgg16_300_240e_voc.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-
-### 执行预测
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/label_list.txt
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/label_list.txt
-aeroplane
-bicycle
-bird
-boat
-bottle
-bus
-car
-cat
-chair
-cow
-diningtable
-dog
-horse
-motorbike
-person
-pottedplant
-sheep
-sofa
-train
-tvmonitor
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/test_client.py
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize(
-        (300, 300), False, interpolation=cv2.INTER_LINEAR), 
-        DetectionNormalize([104.0, 117.0, 123.0], [1.0, 1.0, 1.0], False),
-        DetectionTranspose((2,0,1)),
-])
-
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/000000570688.jpg
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/000000570688.jpg
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/README.md
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/README.md
-# TTF-Net model on Paddle Serving
-
-([简体中文](./README_CN.md)|English)
-
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/ttfnet_darknet53_1x_coco.tar
-```
-
-### Start the service
-```
-tar xf ttfnet_darknet53_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/README_CN.md
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/README_CN.md
-# 使用Paddle Serving部署TTF-Net模型
-
-(简体中文|[English](./README.md))
-
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/ttfnet_darknet53_1x_coco.tar
-```
-
-
-### 启动服务
-```
-tar xf ttfnet_darknet53_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/label_list.txt
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/test_client.py
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/test_client.py
--- a/python/examples/detection/yolov3_darknet53_270e_coco/000000570688.jpg
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/000000570688.jpg
--- a/python/examples/detection/yolov3_darknet53_270e_coco/README.md
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/README.md
--- a/python/examples/detection/yolov3_darknet53_270e_coco/README_CN.md
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/README_CN.md
--- a/python/examples/detection/yolov3_darknet53_270e_coco/label_list.txt
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/label_list.txt
--- a/python/examples/detection/yolov3_darknet53_270e_coco/test_client.py
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/test_client.py
--- a/python/examples/encryption/README.md
+++ b/python/examples/encryption/README.md
--- a/python/examples/encryption/README_CN.md
+++ b/python/examples/encryption/README_CN.md
--- a/python/examples/encryption/encrypt.py
+++ b/python/examples/encryption/encrypt.py
--- a/python/examples/encryption/get_data.sh
+++ b/python/examples/encryption/get_data.sh
--- a/python/examples/encryption/test_client.py
+++ b/python/examples/encryption/test_client.py
--- a/python/examples/fit_a_line/README.md
+++ b/python/examples/fit_a_line/README.md
--- a/python/examples/fit_a_line/README_CN.md
+++ b/python/examples/fit_a_line/README_CN.md
--- a/python/examples/fit_a_line/benchmark.py
+++ b/python/examples/fit_a_line/benchmark.py
--- a/python/examples/fit_a_line/benchmark.sh
+++ b/python/examples/fit_a_line/benchmark.sh
--- a/python/examples/fit_a_line/get_data.sh
+++ b/python/examples/fit_a_line/get_data.sh
--- a/python/examples/fit_a_line/local_train.py
+++ b/python/examples/fit_a_line/local_train.py
--- a/python/examples/fit_a_line/test_client.py
+++ b/python/examples/fit_a_line/test_client.py
--- a/python/examples/fit_a_line/test_httpclient.py
+++ b/python/examples/fit_a_line/test_httpclient.py
--- a/python/examples/fit_a_line/test_multi_process_client.py
+++ b/python/examples/fit_a_line/test_multi_process_client.py
--- a/python/examples/fit_a_line/test_server.py
+++ b/python/examples/fit_a_line/test_server.py
--- a/python/examples/imagenet/README.md
+++ b/python/examples/imagenet/README.md
--- a/python/examples/imagenet/README_CN.md
+++ b/python/examples/imagenet/README_CN.md
--- a/python/examples/imagenet/benchmark.py
+++ b/python/examples/imagenet/benchmark.py
--- a/python/examples/imagenet/benchmark.sh
+++ b/python/examples/imagenet/benchmark.sh
--- a/python/examples/imagenet/daisy.jpg
+++ b/python/examples/imagenet/daisy.jpg
--- a/python/examples/imagenet/data/n01440764_10026.JPEG
+++ b/python/examples/imagenet/data/n01440764_10026.JPEG
--- a/python/examples/imagenet/flower.jpg
+++ b/python/examples/imagenet/flower.jpg
--- a/python/examples/imagenet/get_model.sh
+++ b/python/examples/imagenet/get_model.sh
--- a/python/examples/imagenet/imagenet.label
+++ b/python/examples/imagenet/imagenet.label
--- a/python/examples/imagenet/resnet50_http_client.py
+++ b/python/examples/imagenet/resnet50_http_client.py
--- a/python/examples/imagenet/resnet50_rpc_client.py
+++ b/python/examples/imagenet/resnet50_rpc_client.py
--- a/python/examples/imagenet/test_image_reader.py
+++ b/python/examples/imagenet/test_image_reader.py
--- a/python/examples/imdb/README.md
+++ b/python/examples/imdb/README.md
--- a/python/examples/imdb/README_CN.md
+++ b/python/examples/imdb/README_CN.md
--- a/python/examples/imdb/abtest_client.py
+++ b/python/examples/imdb/abtest_client.py
--- a/python/examples/imdb/abtest_get_data.py
+++ b/python/examples/imdb/abtest_get_data.py
--- a/python/examples/imdb/benchmark.py
+++ b/python/examples/imdb/benchmark.py
--- a/python/examples/imdb/benchmark.sh
+++ b/python/examples/imdb/benchmark.sh
--- a/python/examples/imdb/clean_data.sh
+++ b/python/examples/imdb/clean_data.sh
--- a/python/examples/imdb/get_data.sh
+++ b/python/examples/imdb/get_data.sh
--- a/python/examples/imdb/imdb_reader.py
+++ b/python/examples/imdb/imdb_reader.py
--- a/python/examples/imdb/local_train.py
+++ b/python/examples/imdb/local_train.py
--- a/python/examples/imdb/nets.py
+++ b/python/examples/imdb/nets.py
--- a/python/examples/imdb/test_client.py
+++ b/python/examples/imdb/test_client.py
--- a/python/examples/imdb/test_http_client.py
+++ b/python/examples/imdb/test_http_client.py
--- a/python/examples/lac/README.md
+++ b/python/examples/lac/README.md
--- a/python/examples/lac/README_CN.md
+++ b/python/examples/lac/README_CN.md
--- a/python/examples/lac/benchmark.py
+++ b/python/examples/lac/benchmark.py
--- a/python/examples/lac/lac_client.py
+++ b/python/examples/lac/lac_client.py
--- a/python/examples/lac/lac_http_client.py
+++ b/python/examples/lac/lac_http_client.py
--- a/python/examples/lac/lac_reader.py
+++ b/python/examples/lac/lac_reader.py
--- a/python/examples/lac/utils.py
+++ b/python/examples/lac/utils.py
--- a/python/examples/low_precision/resnet50/README.md
+++ b/python/examples/low_precision/resnet50/README.md
--- a/python/examples/low_precision/resnet50/README_CN.md
+++ b/python/examples/low_precision/resnet50/README_CN.md
--- a/python/examples/low_precision/resnet50/daisy.jpg
+++ b/python/examples/low_precision/resnet50/daisy.jpg
--- a/python/examples/low_precision/resnet50/resnet50_client.py
+++ b/python/examples/low_precision/resnet50/resnet50_client.py
--- a/python/examples/mobilenet/README.md
+++ b/python/examples/mobilenet/README.md
--- a/python/examples/mobilenet/README_CN.md
+++ b/python/examples/mobilenet/README_CN.md
--- a/python/examples/mobilenet/daisy.jpg
+++ b/python/examples/mobilenet/daisy.jpg
--- a/python/examples/mobilenet/mobilenet_tutorial.py
+++ b/python/examples/mobilenet/mobilenet_tutorial.py
--- a/python/examples/ocr/README.md
+++ b/python/examples/ocr/README.md
--- a/python/examples/ocr/README_CN.md
+++ b/python/examples/ocr/README_CN.md
--- a/python/examples/ocr/det_debugger_server.py
+++ b/python/examples/ocr/det_debugger_server.py
--- a/python/examples/ocr/det_web_server.py
+++ b/python/examples/ocr/det_web_server.py
--- a/python/examples/ocr/imgs/1.jpg
+++ b/python/examples/ocr/imgs/1.jpg
--- a/python/examples/ocr/ocr_cpp_client.py
+++ b/python/examples/ocr/ocr_cpp_client.py
--- a/python/examples/ocr/ocr_debugger_server.py
+++ b/python/examples/ocr/ocr_debugger_server.py
--- a/python/examples/ocr/ocr_web_client.py
+++ b/python/examples/ocr/ocr_web_client.py
--- a/python/examples/ocr/ocr_web_server.py
+++ b/python/examples/ocr/ocr_web_server.py
--- a/python/examples/ocr/rec_debugger_server.py
+++ b/python/examples/ocr/rec_debugger_server.py
--- a/python/examples/ocr/rec_img/ch_doc3.jpg
+++ b/python/examples/ocr/rec_img/ch_doc3.jpg
--- a/python/examples/ocr/rec_web_client.py
+++ b/python/examples/ocr/rec_web_client.py
--- a/python/examples/ocr/rec_web_server.py
+++ b/python/examples/ocr/rec_web_server.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/README.md
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/README.md
--- a/python/examples/pipeline/PaddleClas/DarkNet53/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/README_CN.md
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark_cfg.yaml
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark_cfg.yaml
--- a/python/examples/pipeline/PaddleClas/DarkNet53/config.yml
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/config.yml
--- a/python/examples/pipeline/PaddleClas/DarkNet53/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/DarkNet53/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/get_model.sh
--- a/python/examples/pipeline/PaddleClas/DarkNet53/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/imagenet.label
--- a/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/README.md
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/README.md
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/README_CN.md
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/config.yml
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/config.yml
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/get_model.sh
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/imagenet.label
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README.md
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README.md
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README.md
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/config.yml
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/config.yml
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/README.md
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/README.md
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/config.yml
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/config.yml
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/web_service.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README.md
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README.md
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README_CN.md
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README_CN.md
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/config.yml
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/config.yml
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/web_service.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/yolov3/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/yolov3/README.md
+++ b/python/examples/pipeline/PaddleDetection/yolov3/README.md
--- a/python/examples/pipeline/PaddleDetection/yolov3/README_CN.md
+++ b/python/examples/pipeline/PaddleDetection/yolov3/README_CN.md
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/yolov3/config.yml
+++ b/python/examples/pipeline/PaddleDetection/yolov3/config.yml
--- a/python/examples/pipeline/PaddleDetection/yolov3/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/yolov3/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/yolov3/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/web_service.py
--- a/python/examples/pipeline/bert/README.md
+++ b/python/examples/pipeline/bert/README.md
--- a/python/examples/pipeline/bert/README_CN.md
+++ b/python/examples/pipeline/bert/README_CN.md
--- a/python/examples/pipeline/bert/benchmark.py
+++ b/python/examples/pipeline/bert/benchmark.py
--- a/python/examples/pipeline/bert/benchmark.sh
+++ b/python/examples/pipeline/bert/benchmark.sh
--- a/python/examples/pipeline/bert/config.yml
+++ b/python/examples/pipeline/bert/config.yml
--- a/python/examples/pipeline/bert/get_data.sh
+++ b/python/examples/pipeline/bert/get_data.sh
--- a/python/examples/pipeline/bert/pipeline_rpc_client.py
+++ b/python/examples/pipeline/bert/pipeline_rpc_client.py
--- a/python/examples/pipeline/bert/web_service.py
+++ b/python/examples/pipeline/bert/web_service.py
--- a/python/examples/pipeline/imagenet/README.md
+++ b/python/examples/pipeline/imagenet/README.md
--- a/python/examples/pipeline/imagenet/README_CN.md
+++ b/python/examples/pipeline/imagenet/README_CN.md
--- a/python/examples/pipeline/imagenet/config.yml
+++ b/python/examples/pipeline/imagenet/config.yml
--- a/python/examples/pipeline/imagenet/daisy.jpg
+++ b/python/examples/pipeline/imagenet/daisy.jpg
--- a/python/examples/pipeline/imagenet/get_model.sh
+++ b/python/examples/pipeline/imagenet/get_model.sh
--- a/python/examples/pipeline/imagenet/imagenet.label
+++ b/python/examples/pipeline/imagenet/imagenet.label
--- a/python/examples/pipeline/imagenet/pipeline_rpc_client.py
+++ b/python/examples/pipeline/imagenet/pipeline_rpc_client.py
--- a/python/examples/pipeline/imagenet/resnet50_web_service.py
+++ b/python/examples/pipeline/imagenet/resnet50_web_service.py
--- a/python/examples/pipeline/imdb_model_ensemble/README.md
+++ b/python/examples/pipeline/imdb_model_ensemble/README.md
--- a/python/examples/pipeline/imdb_model_ensemble/README_CN.md
+++ b/python/examples/pipeline/imdb_model_ensemble/README_CN.md
--- a/python/examples/pipeline/imdb_model_ensemble/analyse.py
+++ b/python/examples/pipeline/imdb_model_ensemble/analyse.py
--- a/python/examples/pipeline/imdb_model_ensemble/analyse.yaml
+++ b/python/examples/pipeline/imdb_model_ensemble/analyse.yaml
--- a/python/examples/pipeline/imdb_model_ensemble/config.yml
+++ b/python/examples/pipeline/imdb_model_ensemble/config.yml
--- a/python/examples/pipeline/imdb_model_ensemble/get_data.sh
+++ b/python/examples/pipeline/imdb_model_ensemble/get_data.sh
--- a/python/examples/pipeline/imdb_model_ensemble/test_pipeline_client.py
+++ b/python/examples/pipeline/imdb_model_ensemble/test_pipeline_client.py
--- a/python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
+++ b/python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
--- a/python/examples/pipeline/ocr/README.md
+++ b/python/examples/pipeline/ocr/README.md
--- a/python/examples/pipeline/ocr/README_CN.md
+++ b/python/examples/pipeline/ocr/README_CN.md
--- a/python/examples/pipeline/ocr/benchmark.py
+++ b/python/examples/pipeline/ocr/benchmark.py
--- a/python/examples/pipeline/ocr/benchmark.sh
+++ b/python/examples/pipeline/ocr/benchmark.sh
--- a/python/examples/pipeline/ocr/config.yml
+++ b/python/examples/pipeline/ocr/config.yml
--- a/python/examples/pipeline/ocr/imgs/1.jpg
+++ b/python/examples/pipeline/ocr/imgs/1.jpg
--- a/python/examples/pipeline/ocr/pipeline_http_client.py
+++ b/python/examples/pipeline/ocr/pipeline_http_client.py
--- a/python/examples/pipeline/ocr/pipeline_rpc_client.py
+++ b/python/examples/pipeline/ocr/pipeline_rpc_client.py
--- a/python/examples/pipeline/ocr/web_service.py
+++ b/python/examples/pipeline/ocr/web_service.py
--- a/python/examples/pipeline/simple_web_service/README.md
+++ b/python/examples/pipeline/simple_web_service/README.md
--- a/python/examples/pipeline/simple_web_service/README_CN.md
+++ b/python/examples/pipeline/simple_web_service/README_CN.md
--- a/python/examples/pipeline/simple_web_service/benchmark.py
+++ b/python/examples/pipeline/simple_web_service/benchmark.py
--- a/python/examples/pipeline/simple_web_service/benchmark.sh
+++ b/python/examples/pipeline/simple_web_service/benchmark.sh
--- a/python/examples/pipeline/simple_web_service/config.yml
+++ b/python/examples/pipeline/simple_web_service/config.yml
--- a/python/examples/pipeline/simple_web_service/get_data.sh
+++ b/python/examples/pipeline/simple_web_service/get_data.sh
--- a/python/examples/pipeline/simple_web_service/web_service.py
+++ b/python/examples/pipeline/simple_web_service/web_service.py
--- a/python/examples/pipeline/simple_web_service/web_service_java.py
+++ b/python/examples/pipeline/simple_web_service/web_service_java.py
--- a/python/examples/resnet_v2_50/README.md
+++ b/python/examples/resnet_v2_50/README.md
--- a/python/examples/resnet_v2_50/README_CN.md
+++ b/python/examples/resnet_v2_50/README_CN.md
--- a/python/examples/resnet_v2_50/benchmark.py
+++ b/python/examples/resnet_v2_50/benchmark.py
--- a/python/examples/resnet_v2_50/benchmark.sh
+++ b/python/examples/resnet_v2_50/benchmark.sh
--- a/python/examples/resnet_v2_50/daisy.jpg
+++ b/python/examples/resnet_v2_50/daisy.jpg
--- a/python/examples/resnet_v2_50/resnet50_debug.py
+++ b/python/examples/resnet_v2_50/resnet50_debug.py
--- a/python/examples/resnet_v2_50/resnet50_v2_tutorial.py
+++ b/python/examples/resnet_v2_50/resnet50_v2_tutorial.py
--- a/python/examples/resnet_v2_50/run_benchmark.sh
+++ b/python/examples/resnet_v2_50/run_benchmark.sh
--- a/python/examples/senta/README.md
+++ b/python/examples/senta/README.md
--- a/python/examples/senta/README_CN.md
+++ b/python/examples/senta/README_CN.md
--- a/python/examples/senta/get_data.sh
+++ b/python/examples/senta/get_data.sh
--- a/python/examples/senta/senta_web_service.py
+++ b/python/examples/senta/senta_web_service.py
--- a/python/examples/unet_for_image_seg/N0060.jpg
+++ b/python/examples/unet_for_image_seg/N0060.jpg
--- a/python/examples/unet_for_image_seg/README.md
+++ b/python/examples/unet_for_image_seg/README.md
--- a/python/examples/unet_for_image_seg/README_CN.md
+++ b/python/examples/unet_for_image_seg/README_CN.md
--- a/python/examples/unet_for_image_seg/seg_client.py
+++ b/python/examples/unet_for_image_seg/seg_client.py
--- a/python/examples/unet_for_image_seg/unet_benchmark/README.md
+++ b/python/examples/unet_for_image_seg/unet_benchmark/README.md
--- a/python/examples/unet_for_image_seg/unet_benchmark/img_data/N0060.jpg
+++ b/python/examples/unet_for_image_seg/unet_benchmark/img_data/N0060.jpg
--- a/python/examples/unet_for_image_seg/unet_benchmark/launch_benckmark.sh
+++ b/python/examples/unet_for_image_seg/unet_benchmark/launch_benckmark.sh
--- a/python/examples/unet_for_image_seg/unet_benchmark/unet_benchmark.py
+++ b/python/examples/unet_for_image_seg/unet_benchmark/unet_benchmark.py
--- a/python/examples/util/README.md
+++ b/python/examples/util/README.md
--- a/python/examples/util/README_CN.md
+++ b/python/examples/util/README_CN.md
--- a/python/examples/util/get_acc.py
+++ b/python/examples/util/get_acc.py
--- a/python/examples/util/show_profile.py
+++ b/python/examples/util/show_profile.py
--- a/python/examples/util/timeline_trace.py
+++ b/python/examples/util/timeline_trace.py
--- a/python/examples/xpu/bert/README.md
+++ b/python/examples/xpu/bert/README.md
--- a/python/examples/xpu/bert/bert_client.py
+++ b/python/examples/xpu/bert/bert_client.py
--- a/python/examples/xpu/bert/chinese_bert_reader.py
+++ b/python/examples/xpu/bert/chinese_bert_reader.py
--- a/python/examples/xpu/bert/get_data.sh
+++ b/python/examples/xpu/bert/get_data.sh
--- a/python/examples/xpu/ernie/README.md
+++ b/python/examples/xpu/ernie/README.md
--- a/python/examples/xpu/ernie/chinese_ernie_reader.py
+++ b/python/examples/xpu/ernie/chinese_ernie_reader.py
--- a/python/examples/xpu/ernie/ernie_client.py
+++ b/python/examples/xpu/ernie/ernie_client.py
--- a/python/examples/xpu/ernie/get_data.sh
+++ b/python/examples/xpu/ernie/get_data.sh
--- a/python/examples/xpu/fit_a_line_xpu/README.md
+++ b/python/examples/xpu/fit_a_line_xpu/README.md
--- a/python/examples/xpu/fit_a_line_xpu/README_CN.md
+++ b/python/examples/xpu/fit_a_line_xpu/README_CN.md
--- a/python/examples/xpu/fit_a_line_xpu/benchmark.py
+++ b/python/examples/xpu/fit_a_line_xpu/benchmark.py
--- a/python/examples/xpu/fit_a_line_xpu/get_data.sh
+++ b/python/examples/xpu/fit_a_line_xpu/get_data.sh
--- a/python/examples/xpu/fit_a_line_xpu/local_train.py
+++ b/python/examples/xpu/fit_a_line_xpu/local_train.py
--- a/python/examples/xpu/fit_a_line_xpu/test_client.py
+++ b/python/examples/xpu/fit_a_line_xpu/test_client.py
--- a/python/examples/xpu/resnet_v2_50_xpu/README.md
+++ b/python/examples/xpu/resnet_v2_50_xpu/README.md
--- a/python/examples/xpu/resnet_v2_50_xpu/README_CN.md
+++ b/python/examples/xpu/resnet_v2_50_xpu/README_CN.md
--- a/python/examples/xpu/resnet_v2_50_xpu/daisy.jpg
+++ b/python/examples/xpu/resnet_v2_50_xpu/daisy.jpg
--- a/python/examples/xpu/resnet_v2_50_xpu/localpredict.py
+++ b/python/examples/xpu/resnet_v2_50_xpu/localpredict.py
--- a/python/examples/xpu/resnet_v2_50_xpu/resnet50_client.py
+++ b/python/examples/xpu/resnet_v2_50_xpu/resnet50_client.py
--- a/python/examples/xpu/vgg19/README.md
+++ b/python/examples/xpu/vgg19/README.md
--- a/python/examples/xpu/vgg19/daisy.jpg
+++ b/python/examples/xpu/vgg19/daisy.jpg
--- a/python/examples/xpu/vgg19/vgg19_client.py
+++ b/python/examples/xpu/vgg19/vgg19_client.py
--- a/python/examples/yolov4/000000570688.jpg
+++ b/python/examples/yolov4/000000570688.jpg
--- a/python/examples/yolov4/README.md
+++ b/python/examples/yolov4/README.md
--- a/python/examples/yolov4/README_CN.md
+++ b/python/examples/yolov4/README_CN.md
--- a/python/examples/yolov4/label_list.txt
+++ b/python/examples/yolov4/label_list.txt
--- a/python/examples/yolov4/test_client.py
+++ b/python/examples/yolov4/test_client.py