Delete python/examples directory

c3ebc026 · huangjianhui · GitHub · f11629c8 · f11629c8 · f11629c8
423 changed file
--- a/python/examples/README.md
+++ b/python/examples/README.md
-## Examples
-### Support `--use_trt`
-the following models support `--use_trt`, which means you can use TensorRT to accelerate inference at Cuda 10.1 or higher.
- imagenet ResNet50/ResNet101
- detection faster_rcnn/yolov3/pp-yolo/ttf-net
--- a/python/examples/README_CN.md
+++ b/python/examples/README_CN.md
-## Serving模型示例
-### 支持TensorRT的模型列表 `--use_trt`
-以下模型支持TensorRT，可以开启 `--use_trt`来加速在线预测，其他模型不能开启。
- imagenet ResNet50/ResNet101
- detection faster_rcnn/yolov3/pp-yolo/ttf-net
--- a/python/examples/bert/README.md
+++ b/python/examples/bert/README.md
-Http## Bert as service
-([简体中文](./README_CN.md)|English)
-In the example, a BERT model is used for semantic understanding prediction, and the text is represented as a vector, which can be used for further analysis and prediction.
-If your python version is 3.X, replace the 'pip' field in the following command with 'pip3',replace 'python' with 'python3'.
-### Getting Model
-method 1:
-This example use model [BERT Chinese Model](https://www.paddlepaddle.org.cn/hubdetail?name=bert_chinese_L-12_H-768_A-12&en_category=SemanticModel) from [Paddlehub](https://github.com/PaddlePaddle/PaddleHub).
-Install paddlehub first
-```
-pip3 install paddlehub
-```
-run 
-```
-python3 prepare_model.py 128
-```
-**PaddleHub only support Python 3.5+**
-the 128 in the command above means max_seq_len in BERT model, which is the length of sample after preprocessing.
-the config file and model file for server side are saved in the folder bert_seq128_model.
-the config file generated for client side is saved in the folder bert_seq128_client.
-method 2:
-You can also download the above model from BOS(max_seq_len=128). After decompression, the config file and model file for server side are stored in the bert_chinese_L-12_H-768_A-12_model folder, and the config file generated for client side is stored in the bert_chinese_L-12_H-768_A-12_client folder:
-```shell
-wget https://paddle-serving.bj.bcebos.com/paddle_hub_models/text/SemanticModel/bert_chinese_L-12_H-768_A-12.tar.gz
-tar -xzf bert_chinese_L-12_H-768_A-12.tar.gz
-mv bert_chinese_L-12_H-768_A-12_model bert_seq128_model
-mv bert_chinese_L-12_H-768_A-12_client bert_seq128_client
-```
-if your model is bert_chinese_L-12_H-768_A-12_model, replace the 'bert_seq128_model' field in the following command with 'bert_chinese_L-12_H-768_A-12_model',replace 'bert_seq128_client' with 'bert_chinese_L-12_H-768_A-12_client'.
-### Getting Dict and Sample Dataset
-```
-sh get_data.sh
-```
-this script will download Chinese Dictionary File vocab.txt and Chinese Sample Data data-c.txt
-### Inference Service(Support BRPC-Client、GRPC-Client、Http-Client)
-start cpu inference service,Run
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292  #cpu inference service
-```
-Or,start gpu inference service,Run
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292 --gpu_ids 0 #launch gpu inference service at GPU 0
-```
-### BRPC-Client Inference
-before prediction we should install paddle_serving_app. This module provides data preprocessing for BERT model.
-```
-pip3 install paddle_serving_app
-```
-Run
-```
-head data-c.txt | python3 bert_client.py --model bert_seq128_client/serving_client_conf.prototxt
-```
-the client reads data from data-c.txt and send prediction request, the prediction is given by word vector. (Due to massive data in the word vector, we do not print it).
-#### GRPC-Client/HTTP-Client
-Run
-```
-head data-c.txt | python3 bert_httpclient.py --model bert_seq128_client/serving_client_conf.prototxt
-```
-## Benchmark
-``` shell
-bash benchmark.sh bert_seq128_model bert_seq128_client
-```
-The output log file of benchmark named `profile_log_bert_seq128_model`
--- a/python/examples/bert/README_CN.md
+++ b/python/examples/bert/README_CN.md
-## 语义理解预测服务
-(简体中文|[English](./README.md))
-示例中采用BERT模型进行语义理解预测，将文本表示为向量的形式，可以用来做进一步的分析和预测。
-若使用python的版本为3.X, 将以下命令中的pip 替换为pip3, python替换为python3.
-### 获取模型
-方法1：
-示例中采用[Paddlehub](https://github.com/PaddlePaddle/PaddleHub)中的[BERT中文模型](https://www.paddlepaddle.org.cn/hubdetail?name=bert_chinese_L-12_H-768_A-12&en_category=SemanticModel)。
-请先安装paddlehub
-```
-pip3 install paddlehub
-```
-执行
-```
-python3 prepare_model.py 128
-```
-参数128表示BERT模型中的max_seq_len，即预处理后的样本长度。
-生成server端配置文件与模型文件，存放在bert_seq128_model文件夹。
-生成client端配置文件，存放在bert_seq128_client文件夹。
-方法2：
-您也可以从bos上直接下载上述模型（max_seq_len=128），解压后server端配置文件与模型文件存放在bert_chinese_L-12_H-768_A-12_model文件夹，client端配置文件存放在bert_chinese_L-12_H-768_A-12_client文件夹：
-```shell
-wget https://paddle-serving.bj.bcebos.com/paddle_hub_models/text/SemanticModel/bert_chinese_L-12_H-768_A-12.tar.gz
-tar -xzf bert_chinese_L-12_H-768_A-12.tar.gz
-mv bert_chinese_L-12_H-768_A-12_model bert_seq128_model
-mv bert_chinese_L-12_H-768_A-12_client bert_seq128_client
-```
-若使用bert_chinese_L-12_H-768_A-12_model模型，将下面命令中的bert_seq128_model字段替换为bert_chinese_L-12_H-768_A-12_model，bert_seq128_client字段替换为bert_chinese_L-12_H-768_A-12_client.
-### 获取词典和样例数据
-```
-sh get_data.sh
-```
-脚本将下载中文词典vocab.txt和中文样例数据data-c.txt
-### 启动预测服务（支持BRPC-Client、GRPC-Client、HTTP-Client三种方式访问）
-启动cpu预测服务，执行
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292  #启动cpu预测服务
-```
-或者，启动gpu预测服务，执行
-```
-python3 -m paddle_serving_server.serve --model bert_seq128_model/ --port 9292 --gpu_ids 0 #在gpu 0上启动gpu预测服务
-```
-### 执行预测
-执行预测前需要安装paddle_serving_app，模块中提供了BERT模型的数据预处理方法。
-```
-pip3 install paddle_serving_app
-```
-#### BRPC-Client
-执行
-```
-head data-c.txt | python3 bert_client.py --model bert_seq128_client/serving_client_conf.prototxt
-```
-启动client读取data-c.txt中的数据进行预测，预测结果为文本的向量表示（由于数据较多，脚本中没有将输出进行打印），server端的地址在脚本中修改。
-#### GRPC-Client/HTTP-Client
-执行
-```
-head data-c.txt | python3 bert_httpclient.py --model bert_seq128_client/serving_client_conf.prototxt
-```
-## 性能测试
-``` shell
-bash benchmark.sh bert_seq128_model bert_seq128_client
-```
-性能测试的日志文件为profile_log_bert_seq128_model
-如需修改性能测试用例的参数，请修改benchmark.sh中的配置信息。
-注意:bert_seq128_model和bert_seq128_client路径后不要加'/'符号,示例需要在GPU机器上运行。
--- a/python/examples/bert/batching.py
+++ b/python/examples/bert/batching.py
-#coding:utf-8
-#   Copyright (c) 2019 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""Mask, padding and batching."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-import numpy as np
-def prepare_batch_data(insts,
-                       total_token_num,
-                       max_seq_len=128,
-                       pad_id=None,
-                       cls_id=None,
-                       sep_id=None,
-                       mask_id=None,
-                       return_input_mask=True,
-                       return_max_len=True,
-                       return_num_token=False):
-    """
-    1. generate Tensor of data
-    2. generate Tensor of position
-    3. generate self attention mask, [shape: batch_size *  max_len * max_len]
-    """
-    batch_src_ids = [inst[0] for inst in insts]
-    batch_sent_ids = [inst[1] for inst in insts]
-    batch_pos_ids = [inst[2] for inst in insts]
-    labels_list = []
-    # compatible with squad, whose example includes start/end positions,
-    # or unique id
-    for i in range(3, len(insts[0]), 1):
-        labels = [inst[i] for inst in insts]
-        labels = np.array(labels).astype("int64").reshape([-1, 1])
-        labels_list.append(labels)
-    out = batch_src_ids
-    # Second step: padding
-    src_id, self_input_mask = pad_batch_data(
-        out, pad_idx=pad_id, max_seq_len=max_seq_len, return_input_mask=True)
-    pos_id = pad_batch_data(
-        batch_pos_ids,
-        pad_idx=pad_id,
-        max_seq_len=max_seq_len,
-        return_pos=False,
-        return_input_mask=False)
-    sent_id = pad_batch_data(
-        batch_sent_ids,
-        pad_idx=pad_id,
-        max_seq_len=max_seq_len,
-        return_pos=False,
-        return_input_mask=False)
-    return_list = [src_id, pos_id, sent_id, self_input_mask] + labels_list
-    return return_list if len(return_list) > 1 else return_list[0]
-def pad_batch_data(insts,
-                   pad_idx=0,
-                   max_seq_len=128,
-                   return_pos=False,
-                   return_input_mask=False,
-                   return_max_len=False,
-                   return_num_token=False,
-                   return_seq_lens=False):
-    """
-    Pad the instances to the max sequence length in batch, and generate the
-    corresponding position data and input mask.
-    """
-    return_list = []
-    #max_len = max(len(inst) for inst in insts)
-    max_len = max_seq_len
-    # Any token included in dict can be used to pad, since the paddings' loss
-    # will be masked out by weights and make no effect on parameter gradients.
-    inst_data = np.array([
-        list(inst) + list([pad_idx] * (max_len - len(inst))) for inst in insts
-    ])
-    return_list += [inst_data.astype("int64").reshape([-1, max_len, 1])]
-    # position data
-    if return_pos:
-        inst_pos = np.array([
-            list(range(0, len(inst))) + [pad_idx] * (max_len - len(inst))
-            for inst in insts
-        ])
-        return_list += [inst_pos.astype("int64").reshape([-1, max_len, 1])]
-    if return_input_mask:
-        # This is used to avoid attention on paddings.
-        input_mask_data = np.array(
-            [[1] * len(inst) + [0] * (max_len - len(inst)) for inst in insts])
-        input_mask_data = np.expand_dims(input_mask_data, axis=-1)
-        return_list += [input_mask_data.astype("float32")]
-    if return_max_len:
-        return_list += [max_len]
-    if return_num_token:
-        num_token = 0
-        for inst in insts:
-            num_token += len(inst)
-        return_list += [num_token]
-    if return_seq_lens:
-        seq_lens = np.array([len(inst) for inst in insts])
-        return_list += [seq_lens.astype("int64").reshape([-1, 1])]
-    return return_list if len(return_list) > 1 else return_list[0]
--- a/python/examples/bert/benchmark.py
+++ b/python/examples/bert/benchmark.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-import json
-import requests
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args, show_latency
-from paddle_serving_app.reader import ChineseBertReader
-args = benchmark_args()
-def single_func(idx, resource):
-    fin = open("data-c.txt")
-    dataset = []
-    for line in fin:
-        dataset.append(line.strip())
-    profile_flags = False
-    latency_flags = False
-    if os.getenv("FLAGS_profile_client"):
-        profile_flags = True
-    if os.getenv("FLAGS_serving_latency"):
-        latency_flags = True
-        latency_list = []
-    if args.request == "rpc":
-        reader = ChineseBertReader({"max_seq_len": 128})
-        fetch = ["pooled_output"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-        start = time.time()
-        for i in range(turns):
-            if args.batch_size >= 1:
-                l_start = time.time()
-                feed_batch = []
-                b_start = time.time()
-                for bi in range(args.batch_size):
-                    feed_dict = reader.process(dataset[bi])
-                    for key in feed_dict.keys():
-                        feed_dict[key] = np.array(feed_dict[key]).reshape(
-                            (1, 128, 1))
-                    feed_batch.append(feed_dict)
-                b_end = time.time()
-                if profile_flags:
-                    sys.stderr.write(
-                        "PROFILE\tpid:{}\tbert_pre_0:{} bert_pre_1:{}\n".format(
-                            os.getpid(),
-                            int(round(b_start * 1000000)),
-                            int(round(b_end * 1000000))))
-                result = client.predict(
-                    feed=feed_batch, fetch=fetch, batch=True)
-                l_end = time.time()
-                if latency_flags:
-                    latency_list.append(l_end * 1000 - l_start * 1000)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-    elif args.request == "http":
-        reader = ChineseBertReader({"max_seq_len": 128})
-        fetch = ["pooled_output"]
-        server = "http://" + resource["endpoint"][idx % len(resource[
-            "endpoint"])] + "/bert/prediction"
-        start = time.time()
-        for i in range(turns):
-            if args.batch_size >= 1:
-                l_start = time.time()
-                feed_batch = []
-                b_start = time.time()
-                for bi in range(args.batch_size):
-                    feed_batch.append({"words": dataset[bi]})
-                req = json.dumps({"feed": feed_batch, "fetch": fetch})
-                b_end = time.time()
-                if profile_flags:
-                    sys.stderr.write(
-                        "PROFILE\tpid:{}\tbert_pre_0:{} bert_pre_1:{}\n".format(
-                            os.getpid(),
-                            int(round(b_start * 1000000)),
-                            int(round(b_end * 1000000))))
-                result = requests.post(
-                    server,
-                    data=req,
-                    headers={"Content-Type": "application/json"})
-                l_end = time.time()
-                if latency_flags:
-                    latency_list.append(l_end * 1000 - l_start * 1000)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-    else:
-        raise ValueError("not implemented {} request".format(args.request))
-    end = time.time()
-    if latency_flags:
-        return [[end - start], latency_list]
-    else:
-        return [[end - start]]
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292", "127.0.0.1:9293"]
-    turns = 100
-    start = time.time()
-    result = multi_thread_runner.run(
-        single_func, args.thread, {"endpoint": endpoint_list,
-                                   "turns": turns})
-    end = time.time()
-    total_cost = end - start
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-    print("total cost: {}s".format(total_cost))
-    print("each thread cost: {}s. ".format(avg_cost))
-    print("qps: {}samples/s".format(args.batch_size * args.thread * turns /
-                                    total_cost))
-    if os.getenv("FLAGS_serving_latency"):
-        show_latency(result[1])
--- a/python/examples/bert/benchmark.sh
+++ b/python/examples/bert/benchmark.sh
-rm profile_log*
-export CUDA_VISIBLE_DEVICES=0,1
-export FLAGS_profile_server=1
-export FLAGS_profile_client=1
-export FLAGS_serving_latency=1
-gpu_id=0
-#save cpu and gpu utilization log
-if [ -d utilization ];then
-    rm -rf utilization
-else
-    mkdir utilization
-fi
-#start server
-$PYTHONROOT/bin/python3 -m paddle_serving_server.serve --model $1 --port 9292 --thread 4 --gpu_ids 0,1 --mem_optim  --ir_optim >  elog  2>&1 &
-sleep 5
-#warm up
-$PYTHONROOT/bin/python3 benchmark.py --thread 4 --batch_size 1 --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-echo -e "import psutil\nimport time\nwhile True:\n\tcpu_res = psutil.cpu_percent()\n\twith open('cpu.txt', 'a+') as f:\n\t\tf.write(f'{cpu_res}\\\n')\n\ttime.sleep(0.1)" > cpu.py
-for thread_num in 1 4 8 16
-do
-for batch_size in 1 4 16 64
-do
-    job_bt=`date '+%Y%m%d%H%M%S'`
-    nvidia-smi --id=0 --query-compute-apps=used_memory --format=csv -lms 100 > gpu_memory_use.log 2>&1 &
-    nvidia-smi --id=0 --query-gpu=utilization.gpu --format=csv -lms 100 > gpu_utilization.log 2>&1 &
-    rm -rf cpu.txt
-    $PYTHONROOT/bin/python3 cpu.py &
-    gpu_memory_pid=$!
-    $PYTHONROOT/bin/python3 benchmark.py --thread $thread_num --batch_size $batch_size --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-    kill `ps -ef|grep used_memory|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep utilization.gpu|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep cpu.py|awk '{print $2}'` > /dev/null
-    echo "model_name:" $1
-    echo "thread_num:" $thread_num
-    echo "batch_size:" $batch_size
-    echo "=================Done===================="
-    echo "model_name:$1" >> profile_log_$1
-    echo "batch_size:$batch_size" >> profile_log_$1
-    job_et=`date '+%Y%m%d%H%M%S'`
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "CPU_UTILIZATION:", max}' cpu.txt >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "MAX_GPU_MEMORY:", max}' gpu_memory_use.log >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "GPU_UTILIZATION:", max}' gpu_utilization.log >> profile_log_$1
-    rm -rf gpu_use.log gpu_utilization.log
-    $PYTHONROOT/bin/python3 ../util/show_profile.py profile $thread_num >> profile_log_$1
-    tail -n 8 profile >> profile_log_$1
-    echo "" >> profile_log_$1
-done
-done
-#Divided log
-awk 'BEGIN{RS="\n\n"}{i++}{print > "bert_log_"i}' profile_log_$1
-mkdir bert_log && mv bert_log_* bert_log
-ps -ef|grep 'serving'|grep -v grep|cut -c 9-15 | xargs kill -9
--- a/python/examples/bert/benchmark_with_profile.sh
+++ b/python/examples/bert/benchmark_with_profile.sh
-export CUDA_VISIBLE_DEVICES=0,1
-python -m paddle_serving_server.serve --model bert_seq20_model/ --port 9295 --thread 4 --gpu_ids 0,1 2> elog > stdlog &
-export FLAGS_profile_client=1
-export FLAGS_profile_server=1
-sleep 5
-thread_num=4
-python benchmark_batch.py --thread ${thread_num} --batch_size 64 --model serving_client_conf/serving_client_conf.prototxt 2> profile
-python show_profile.py profile ${thread_num}
-python timeline_trace.py profile trace
--- a/python/examples/bert/bert_client.py
+++ b/python/examples/bert/bert_client.py
-# coding:utf-8
-# pylint: disable=doc-string-missing
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-from paddle_serving_client import Client
-from paddle_serving_client.utils import benchmark_args
-from paddle_serving_app.reader import ChineseBertReader
-import numpy as np
-args = benchmark_args()
-reader = ChineseBertReader({"max_seq_len": 128})
-fetch = ["pooled_output"]
-endpoint_list = ['127.0.0.1:9292']
-client = Client()
-client.load_client_config(args.model)
-client.connect(endpoint_list)
-for line in sys.stdin:
-    feed_dict = reader.process(line)
-    for key in feed_dict.keys():
-        feed_dict[key] = np.array(feed_dict[key]).reshape((128, 1))
-    #print(feed_dict)
-    result = client.predict(feed=feed_dict, fetch=fetch, batch=False)
-print(result)
--- a/python/examples/bert/bert_gpu_server.py
+++ b/python/examples/bert/bert_gpu_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-general_response_op = op_maker.create('general_response')
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(general_response_op)
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(8)
-server.set_memory_optimize(True)
-server.set_gpuid(1)
-server.load_model_config(sys.argv[1])
-port = int(sys.argv[2])
-gpuid = sys.argv[3]
-server.set_gpuid(gpuid)
-server.prepare_server(workdir="work_dir1", port=port, device="gpu")
-server.run_server()
--- a/python/examples/bert/bert_httpclient.py
+++ b/python/examples/bert/bert_httpclient.py
-# coding:utf-8
-# pylint: disable=doc-string-missing
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-from paddle_serving_client import HttpClient
-from paddle_serving_client.utils import benchmark_args
-from paddle_serving_app.reader import ChineseBertReader
-import numpy as np
-args = benchmark_args()
-reader = ChineseBertReader({"max_seq_len": 128})
-fetch = ["pooled_output"]
-endpoint_list = ['127.0.0.1:9292']
-client = HttpClient()
-client.load_client_config(args.model)
-''' 
-if you want use GRPC-client, set_use_grpc_client(True)
-or you can directly use client.grpc_client_predict(...)
-as for HTTP-client,set_use_grpc_client(False)(which is default)
-or you can directly use client.http_client_predict(...)
-'''
-#client.set_use_grpc_client(True)
-'''
-if you want to enable Encrypt Module,uncommenting the following line
-'''
-#client.use_key("./key")
-'''
-if you want to compress,uncommenting the following line
-'''
-#client.set_response_compress(True)
-#client.set_request_compress(True)
-'''
-we recommend use Proto data format in HTTP-body, set True(which is default)
-if you want use JSON data format in HTTP-body, set False
-'''
-#client.set_http_proto(True)
-client.connect(endpoint_list)
-for line in sys.stdin:
-    feed_dict = reader.process(line)
-    for key in feed_dict.keys():
-        feed_dict[key] = np.array(feed_dict[key]).reshape((128, 1))
-    #print(feed_dict)
-    result = client.predict(feed=feed_dict, fetch=fetch, batch=False)
-print(result)
--- a/python/examples/bert/bert_reader.py
+++ b/python/examples/bert/bert_reader.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from batching import pad_batch_data
-import tokenization
-class BertReader():
-    def __init__(self, vocab_file="", max_seq_len=128):
-        self.vocab_file = vocab_file
-        self.tokenizer = tokenization.FullTokenizer(vocab_file=vocab_file)
-        self.max_seq_len = max_seq_len
-        self.vocab = self.tokenizer.vocab
-        self.pad_id = self.vocab["[PAD]"]
-        self.cls_id = self.vocab["[CLS]"]
-        self.sep_id = self.vocab["[SEP]"]
-        self.mask_id = self.vocab["[MASK]"]
-    def pad_batch(self, token_ids, text_type_ids, position_ids):
-        batch_token_ids = [token_ids]
-        batch_text_type_ids = [text_type_ids]
-        batch_position_ids = [position_ids]
-        padded_token_ids, input_mask = pad_batch_data(
-            batch_token_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id,
-            return_input_mask=True)
-        padded_text_type_ids = pad_batch_data(
-            batch_text_type_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id)
-        padded_position_ids = pad_batch_data(
-            batch_position_ids,
-            max_seq_len=self.max_seq_len,
-            pad_idx=self.pad_id)
-        return padded_token_ids, padded_position_ids, padded_text_type_ids, input_mask
-    def process(self, sent):
-        text_a = tokenization.convert_to_unicode(sent)
-        tokens_a = self.tokenizer.tokenize(text_a)
-        if len(tokens_a) > self.max_seq_len - 2:
-            tokens_a = tokens_a[0:(self.max_seq_len - 2)]
-        tokens = []
-        text_type_ids = []
-        tokens.append("[CLS]")
-        text_type_ids.append(0)
-        for token in tokens_a:
-            tokens.append(token)
-            text_type_ids.append(0)
-        token_ids = self.tokenizer.convert_tokens_to_ids(tokens)
-        position_ids = list(range(len(token_ids)))
-        p_token_ids, p_pos_ids, p_text_type_ids, input_mask = \
-            self.pad_batch(token_ids, text_type_ids, position_ids)
-        feed_result = {
-            "input_ids": p_token_ids.reshape(-1).tolist(),
-            "position_ids": p_pos_ids.reshape(-1).tolist(),
-            "segment_ids": p_text_type_ids.reshape(-1).tolist(),
-            "input_mask": input_mask.reshape(-1).tolist()
-        }
-        return feed_result
--- a/python/examples/bert/bert_server.py
+++ b/python/examples/bert/bert_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-general_response_op = op_maker.create('general_response')
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(general_response_op)
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(4)
-server.load_model_config(sys.argv[1])
-port = int(sys.argv[2])
-server.prepare_server(workdir="work_dir1", port=port, device="cpu")
-server.run_server()
--- a/python/examples/bert/get_data.sh
+++ b/python/examples/bert/get_data.sh
-wget https://paddle-serving.bj.bcebos.com/bert_example/data-c.txt --no-check-certificate
-wget https://paddle-serving.bj.bcebos.com/bert_example/vocab.txt --no-check-certificate
--- a/python/examples/bert/prepare_model.py
+++ b/python/examples/bert/prepare_model.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import paddlehub as hub
-import paddle.fluid as fluid
-import sys
-import paddle_serving_client.io as serving_io
-import paddle
-paddle.enable_static()
-model_name = "bert_chinese_L-12_H-768_A-12"
-module = hub.Module(name=model_name)
-inputs, outputs, program = module.context(
-    trainable=True, max_seq_len=int(sys.argv[1]))
-place = fluid.core_avx.CPUPlace()
-exe = fluid.Executor(place)
-input_ids = inputs["input_ids"]
-position_ids = inputs["position_ids"]
-segment_ids = inputs["segment_ids"]
-input_mask = inputs["input_mask"]
-pooled_output = outputs["pooled_output"]
-sequence_output = outputs["sequence_output"]
-feed_var_names = [
-    input_ids.name, position_ids.name, segment_ids.name, input_mask.name
-]
-target_vars = [pooled_output, sequence_output]
-serving_io.save_model(
-    "bert_seq{}_model".format(sys.argv[1]),
-    "bert_seq{}_client".format(sys.argv[1]), {
-        "input_ids": input_ids,
-        "position_ids": position_ids,
-        "segment_ids": segment_ids,
-        "input_mask": input_mask,
-    }, {"pooled_output": pooled_output,
-        "sequence_output": sequence_output}, program)
--- a/python/examples/bert/test_multi_fetch_client.py
+++ b/python/examples/bert/test_multi_fetch_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-from paddle_serving_client import Client
-from paddle_serving_app.reader import ChineseBertReader
-import sys
-import numpy as np
-client = Client()
-client.load_client_config("./bert_seq32_client/serving_client_conf.prototxt")
-client.connect(["127.0.0.1:9292"])
-reader = ChineseBertReader({"max_seq_len": 32})
-fetch = ["sequence_10", "sequence_12", "pooled_output"]
-expected_shape = {
-    "sequence_10": (4, 32, 768),
-    "sequence_12": (4, 32, 768),
-    "pooled_output": (4, 768)
-}
-batch_size = 4
-feed_batch = {}
-batch_len = 0
-for line in sys.stdin:
-    feed = reader.process(line)
-    if batch_len == 0:
-        for key in feed.keys():
-            val_len = len(feed[key])
-            feed_batch[key] = np.array(feed[key]).reshape((1, val_len, 1))
-        continue
-    if len(feed_batch) < batch_size:
-        for key in feed.keys():
-            np.concatenate([
-                feed_batch[key], np.array(feed[key]).reshape((1, val_len, 1))
-            ])
-    else:
-        fetch_map = client.predict(feed=feed_batch, fetch=fetch)
-        feed_batch = []
-        for var_name in fetch:
-            if fetch_map[var_name].shape != expected_shape[var_name]:
-                print("fetch var {} shape error.".format(var_name))
-                sys.exit(1)
--- a/python/examples/bert/tokenization.py
+++ b/python/examples/bert/tokenization.py
-# coding=utf-8
-# Copyright 2018 The Google AI Language Team Authors.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#         http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""Tokenization classes."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-import collections
-import io
-import unicodedata
-import six
-import sentencepiece as spm
-import pickle
-def convert_to_unicode(text):  # pylint: disable=doc-string-with-all-args
-    """Converts `text` to Unicode (if it's not already), assuming utf-8 input."""
-    if six.PY3:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, bytes):
-            return text.decode("utf-8", "ignore")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    elif six.PY2:
-        if isinstance(text, str):
-            return text.decode("utf-8", "ignore")
-        elif isinstance(text, unicode):  # noqa
-            return text
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    else:
-        raise ValueError("Not running on Python2 or Python 3?")
-def printable_text(text):  # pylint: disable=doc-string-with-all-args
-    """Returns text encoded in a way suitable for print or `tf.logging`."""
-    # These functions want `str` for both Python2 and Python3, but in one case
-    # it's a Unicode string and in the other it's a byte string.
-    if six.PY3:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, bytes):
-            return text.decode("utf-8", "ignore")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    elif six.PY2:
-        if isinstance(text, str):
-            return text
-        elif isinstance(text, unicode):  # noqa
-            return text.encode("utf-8")
-        else:
-            raise ValueError("Unsupported string type: %s" % (type(text)))
-    else:
-        raise ValueError("Not running on Python2 or Python 3?")
-def load_vocab(vocab_file):  # pylint: disable=doc-string-with-all-args, doc-string-with-returns
-    """Loads a vocabulary file into a dictionary."""
-    vocab = collections.OrderedDict()
-    fin = io.open(vocab_file, "r", encoding="UTF-8")
-    for num, line in enumerate(fin):
-        items = convert_to_unicode(line.strip()).split("\t")
-        if len(items) > 2:
-            break
-        token = items[0]
-        index = items[1] if len(items) == 2 else num
-        token = token.strip()
-        vocab[token] = int(index)
-    fin.close()
-    return vocab
-def convert_by_vocab(vocab, items):
-    """Converts a sequence of [tokens|ids] using the vocab."""
-    output = []
-    for item in items:
-        output.append(vocab[item])
-    return output
-def convert_tokens_to_ids(vocab, tokens):
-    return convert_by_vocab(vocab, tokens)
-def convert_ids_to_tokens(inv_vocab, ids):
-    return convert_by_vocab(inv_vocab, ids)
-def whitespace_tokenize(text):
-    """Runs basic whitespace cleaning and splitting on a peice of text."""
-    text = text.strip()
-    if not text:
-        return []
-    tokens = text.split()
-    return tokens
-class FullTokenizer(object):
-    """Runs end-to-end tokenziation."""
-    def __init__(self,
-                 vocab_file,
-                 do_lower_case=True,
-                 use_sentence_piece_vocab=False):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.basic_tokenizer = BasicTokenizer(do_lower_case=do_lower_case)
-        self.use_sentence_piece_vocab = use_sentence_piece_vocab
-        self.wordpiece_tokenizer = WordpieceTokenizer(
-            vocab=self.vocab,
-            use_sentence_piece_vocab=self.use_sentence_piece_vocab)
-    def tokenize(self, text):
-        split_tokens = []
-        for token in self.basic_tokenizer.tokenize(text):
-            for sub_token in self.wordpiece_tokenizer.tokenize(token):
-                split_tokens.append(sub_token)
-        return split_tokens
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-class CharTokenizer(object):
-    """Runs end-to-end tokenziation."""
-    def __init__(self, vocab_file, do_lower_case=True):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.wordpiece_tokenizer = WordpieceTokenizer(vocab=self.vocab)
-    def tokenize(self, text):
-        split_tokens = []
-        for token in text.lower().split(" "):
-            for sub_token in self.wordpiece_tokenizer.tokenize(token):
-                split_tokens.append(sub_token)
-        return split_tokens
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-class WSSPTokenizer(object):  # pylint: disable=doc-string-missing
-    def __init__(self, vocab_file, sp_model_dir, word_dict, ws=True,
-                 lower=True):
-        self.vocab = load_vocab(vocab_file)
-        self.inv_vocab = {v: k for k, v in self.vocab.items()}
-        self.ws = ws
-        self.lower = lower
-        self.dict = pickle.load(open(word_dict, 'rb'))
-        self.sp_model = spm.SentencePieceProcessor()
-        self.window_size = 5
-        self.sp_model.Load(sp_model_dir)
-    def cut(self, chars):  # pylint: disable=doc-string-missing
-        words = []
-        idx = 0
-        while idx < len(chars):
-            matched = False
-            for i in range(self.window_size, 0, -1):
-                cand = chars[idx:idx + i]
-                if cand in self.dict:
-                    words.append(cand)
-                    matched = True
-                    break
-            if not matched:
-                i = 1
-                words.append(chars[idx])
-            idx += i
-        return words
-    def tokenize(self, text, unk_token="[UNK]"):  # pylint: disable=doc-string-missing
-        text = convert_to_unicode(text)
-        if self.ws:
-            text = [s for s in self.cut(text) if s != ' ']
-        else:
-            text = text.split(' ')
-        if self.lower:
-            text = [s.lower() for s in text]
-        text = ' '.join(text)
-        tokens = self.sp_model.EncodeAsPieces(text)
-        in_vocab_tokens = []
-        for token in tokens:
-            if token in self.vocab:
-                in_vocab_tokens.append(token)
-            else:
-                in_vocab_tokens.append(unk_token)
-        return in_vocab_tokens
-    def convert_tokens_to_ids(self, tokens):
-        return convert_by_vocab(self.vocab, tokens)
-    def convert_ids_to_tokens(self, ids):
-        return convert_by_vocab(self.inv_vocab, ids)
-class BasicTokenizer(object):
-    """Runs basic tokenization (punctuation splitting, lower casing, etc.)."""
-    def __init__(self, do_lower_case=True):
-        """Constructs a BasicTokenizer.
-        Args:
-            do_lower_case: Whether to lower case the input.
-        """
-        self.do_lower_case = do_lower_case
-    def tokenize(self, text):  # pylint: disable=doc-string-with-all-args, doc-string-with-returns
-        """Tokenizes a piece of text."""
-        text = convert_to_unicode(text)
-        text = self._clean_text(text)
-        # This was added on November 1st, 2018 for the multilingual and Chinese
-        # models. This is also applied to the English models now, but it doesn't
-        # matter since the English models were not trained on any Chinese data
-        # and generally don't have any Chinese data in them (there are Chinese
-        # characters in the vocabulary because Wikipedia does have some Chinese
-        # words in the English Wikipedia.).
-        text = self._tokenize_chinese_chars(text)
-        orig_tokens = whitespace_tokenize(text)
-        split_tokens = []
-        for token in orig_tokens:
-            if self.do_lower_case:
-                token = token.lower()
-                token = self._run_strip_accents(token)
-            split_tokens.extend(self._run_split_on_punc(token))
-        output_tokens = whitespace_tokenize(" ".join(split_tokens))
-        return output_tokens
-    def _run_strip_accents(self, text):
-        """Strips accents from a piece of text."""
-        text = unicodedata.normalize("NFD", text)
-        output = []
-        for char in text:
-            cat = unicodedata.category(char)
-            if cat == "Mn":
-                continue
-            output.append(char)
-        return "".join(output)
-    def _run_split_on_punc(self, text):
-        """Splits punctuation on a piece of text."""
-        chars = list(text)
-        i = 0
-        start_new_word = True
-        output = []
-        while i < len(chars):
-            char = chars[i]
-            if _is_punctuation(char):
-                output.append([char])
-                start_new_word = True
-            else:
-                if start_new_word:
-                    output.append([])
-                start_new_word = False
-                output[-1].append(char)
-            i += 1
-        return ["".join(x) for x in output]
-    def _tokenize_chinese_chars(self, text):
-        """Adds whitespace around any CJK character."""
-        output = []
-        for char in text:
-            cp = ord(char)
-            if self._is_chinese_char(cp):
-                output.append(" ")
-                output.append(char)
-                output.append(" ")
-            else:
-                output.append(char)
-        return "".join(output)
-    def _is_chinese_char(self, cp):
-        """Checks whether CP is the codepoint of a CJK character."""
-        # This defines a "chinese character" as anything in the CJK Unicode block:
-        #     https://en.wikipedia.org/wiki/CJK_Unified_Ideographs_(Unicode_block)
-        #
-        # Note that the CJK Unicode block is NOT all Japanese and Korean characters,
-        # despite its name. The modern Korean Hangul alphabet is a different block,
-        # as is Japanese Hiragana and Katakana. Those alphabets are used to write
-        # space-separated words, so they are not treated specially and handled
-        # like the all of the other languages.
-        if ((cp >= 0x4E00 and cp <= 0x9FFF) or  #
-            (cp >= 0x3400 and cp <= 0x4DBF) or  #
-            (cp >= 0x20000 and cp <= 0x2A6DF) or  #
-            (cp >= 0x2A700 and cp <= 0x2B73F) or  #
-            (cp >= 0x2B740 and cp <= 0x2B81F) or  #
-            (cp >= 0x2B820 and cp <= 0x2CEAF) or
-            (cp >= 0xF900 and cp <= 0xFAFF) or  #
-            (cp >= 0x2F800 and cp <= 0x2FA1F)):  #
-            return True
-        return False
-    def _clean_text(self, text):
-        """Performs invalid character removal and whitespace cleanup on text."""
-        output = []
-        for char in text:
-            cp = ord(char)
-            if cp == 0 or cp == 0xfffd or _is_control(char):
-                continue
-            if _is_whitespace(char):
-                output.append(" ")
-            else:
-                output.append(char)
-        return "".join(output)
-class WordpieceTokenizer(object):
-    """Runs WordPiece tokenziation."""
-    def __init__(self,
-                 vocab,
-                 unk_token="[UNK]",
-                 max_input_chars_per_word=100,
-                 use_sentence_piece_vocab=False):
-        self.vocab = vocab
-        self.unk_token = unk_token
-        self.max_input_chars_per_word = max_input_chars_per_word
-        self.use_sentence_piece_vocab = use_sentence_piece_vocab
-    def tokenize(self, text):  # pylint: disable=doc-string-with-all-args
-        """Tokenizes a piece of text into its word pieces.
-        This uses a greedy longest-match-first algorithm to perform tokenization
-        using the given vocabulary.
-        For example:
-            input = "unaffable"
-            output = ["un", "##aff", "##able"]
-        Args:
-            text: A single token or whitespace separated tokens. This should have
-                already been passed through `BasicTokenizer.
-        Returns:
-            A list of wordpiece tokens.
-        """
-        text = convert_to_unicode(text)
-        output_tokens = []
-        for token in whitespace_tokenize(text):
-            chars = list(token)
-            if len(chars) > self.max_input_chars_per_word:
-                output_tokens.append(self.unk_token)
-                continue
-            is_bad = False
-            start = 0
-            sub_tokens = []
-            while start < len(chars):
-                end = len(chars)
-                cur_substr = None
-                while start < end:
-                    substr = "".join(chars[start:end])
-                    if start == 0 and self.use_sentence_piece_vocab:
-                        substr = u'\u2581' + substr
-                    if start > 0 and not self.use_sentence_piece_vocab:
-                        substr = "##" + substr
-                    if substr in self.vocab:
-                        cur_substr = substr
-                        break
-                    end -= 1
-                if cur_substr is None:
-                    is_bad = True
-                    break
-                sub_tokens.append(cur_substr)
-                start = end
-            if is_bad:
-                output_tokens.append(self.unk_token)
-            else:
-                output_tokens.extend(sub_tokens)
-        return output_tokens
-def _is_whitespace(char):
-    """Checks whether `chars` is a whitespace character."""
-    # \t, \n, and \r are technically contorl characters but we treat them
-    # as whitespace since they are generally considered as such.
-    if char == " " or char == "\t" or char == "\n" or char == "\r":
-        return True
-    cat = unicodedata.category(char)
-    if cat == "Zs":
-        return True
-    return False
-def _is_control(char):
-    """Checks whether `chars` is a control character."""
-    # These are technically control characters but we count them as whitespace
-    # characters.
-    if char == "\t" or char == "\n" or char == "\r":
-        return False
-    cat = unicodedata.category(char)
-    if cat.startswith("C"):
-        return True
-    return False
-def _is_punctuation(char):
-    """Checks whether `chars` is a punctuation character."""
-    cp = ord(char)
-    # We treat all non-letter/number ASCII as punctuation.
-    # Characters such as "^", "$", and "`" are not in the Unicode
-    # Punctuation class but we treat them as punctuation anyways, for
-    # consistency.
-    if ((cp >= 33 and cp <= 47) or (cp >= 58 and cp <= 64) or
-        (cp >= 91 and cp <= 96) or (cp >= 123 and cp <= 126)):
-        return True
-    cat = unicodedata.category(char)
-    if cat.startswith("P"):
-        return True
-    return False
--- a/python/examples/blazeface/README.md
+++ b/python/examples/blazeface/README.md
-# Blazeface 
-## Get Model
-```
-python3 -m paddle_serving_app.package --get_model blazeface
-tar -xf blazeface.tar.gz
-```
-## RPC Service
-### Start Service
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9494
-```
-### Client Prediction
-```
-python3 test_client.py serving_client/serving_client_conf.prototxt test.jpg
-```
-the result is in `output` folder, including a json file and image file with bounding boxes.
--- a/python/examples/blazeface/test_client.py
+++ b/python/examples/blazeface/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import sys
-import numpy as np
-from paddle_serving_app.reader import BlazeFacePostprocess
-preprocess = Sequential([
-    File2Image(),
-    Normalize([104, 117, 123], [127.502231, 127.502231, 127.502231], False)
-])
-postprocess = BlazeFacePostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config(sys.argv[1])
-client.connect(['127.0.0.1:9494'])
-im_0 = preprocess(sys.argv[2])
-tmp = Transpose((2, 0, 1))
-im = tmp(im_0)
-fetch_map = client.predict(
-    feed={"image": im}, fetch=["detection_output_0.tmp_0"])
-fetch_map["image"] = sys.argv[2]
-fetch_map["im_shape"] = im_0.shape
-postprocess(fetch_map)
--- a/python/examples/cascade_rcnn/000000570688.jpg
+++ b/python/examples/cascade_rcnn/000000570688.jpg
--- a/python/examples/cascade_rcnn/README.md
+++ b/python/examples/cascade_rcnn/README.md
-# Cascade RCNN model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get The Cascade RCNN Model
-```
-sh get_data.sh
-```
-If you want to have more detection models, please refer to [Paddle Detection Model Zoo](https://github.com/PaddlePaddle/PaddleDetection/blob/release/0.2/docs/MODEL_ZOO_cn.md)
-### Start the service
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9292 --gpu_id 0
-```
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
-Image with bounding boxes and json result would be saved in `output` folder.
--- a/python/examples/cascade_rcnn/README_CN.md
+++ b/python/examples/cascade_rcnn/README_CN.md
-# 使用Paddle Serving部署Cascade RCNN模型
-(简体中文|[English](./README.md))
-## 获得Cascade RCNN模型
-```
-sh get_data.sh
-```
-如果你想要更多的检测模型，请参考[Paddle检测模型库](https://github.com/PaddlePaddle/PaddleDetection/blob/release/0.2/docs/MODEL_ZOO_cn.md)
-### 启动服务
-```
-python3 -m paddle_serving_server.serve --model serving_server --port 9292 --gpu_id 0
-```
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
-客户端已经为图片做好了后处理，在`output`文件夹下存放各个框的json格式信息还有后处理结果图片。
--- a/python/examples/cascade_rcnn/get_data.sh
+++ b/python/examples/cascade_rcnn/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/cascade_mask_rcnn_r50_vd_fpn_ssld_2x_coco_serving.tar.gz
-tar xf cascade_mask_rcnn_r50_vd_fpn_ssld_2x_coco_serving.tar.gz
--- a/python/examples/cascade_rcnn/label_list.txt
+++ b/python/examples/cascade_rcnn/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/cascade_rcnn/test_client.py
+++ b/python/examples/cascade_rcnn/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize((800, 1333), True, interpolation=2), 
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(32)
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9292'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/criteo_ctr/README.md
+++ b/python/examples/criteo_ctr/README.md
-## CTR Prediction Service
-([简体中文](./README_CN.md)|English)
-### download criteo dataset
-```
-sh get_data.sh
-```
-### download inference model
-```
-wget https://paddle-serving.bj.bcebos.com/criteo_ctr_example/criteo_ctr_demo_model.tar.gz
-tar xf criteo_ctr_demo_model.tar.gz
-mv models/ctr_client_conf .
-mv models/ctr_serving_model .
-```
-the directories like `ctr_serving_model` and `ctr_client_conf` will appear.
-### Start RPC Inference Service
-```
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 #CPU RPC Service
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 --gpu_ids 0 #RPC Service on GPU 0
-```
-### RPC Infer
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt raw_data/part-0
-```
-the latency will display in the end.
--- a/python/examples/criteo_ctr/README_CN.md
+++ b/python/examples/criteo_ctr/README_CN.md
-## CTR预测服务
-(简体中文|[English](./README.md))
-### 获取样例数据
-```
-sh get_data.sh
-```
-### 下载模型
-```
-wget https://paddle-serving.bj.bcebos.com/criteo_ctr_example/criteo_ctr_demo_model.tar.gz
-tar xf criteo_ctr_demo_model.tar.gz
-mv models/ctr_client_conf .
-mv models/ctr_serving_model .
-```
-会在当前目录出现`ctr_serving_model` 和 `ctr_client_conf`文件夹。
-### 启动RPC预测服务
-```
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 #启动CPU预测服务
-python3 -m paddle_serving_server.serve --model ctr_serving_model/ --port 9292 --gpu_ids 0 #在GPU 0上启动预测服务
-```
-### 执行预测
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt raw_data/part-0
-```
-预测完毕会输出预测过程的耗时。
--- a/python/examples/criteo_ctr/args.py
+++ b/python/examples/criteo_ctr/args.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import argparse
-def parse_args():
-    parser = argparse.ArgumentParser(description="PaddlePaddle CTR example")
-    parser.add_argument(
-        '--train_data_path',
-        type=str,
-        default='./data/raw/train.txt',
-        help="The path of training dataset")
-    parser.add_argument(
-        '--sparse_only',
-        type=bool,
-        default=False,
-        help="Whether we use sparse features only")
-    parser.add_argument(
-        '--test_data_path',
-        type=str,
-        default='./data/raw/valid.txt',
-        help="The path of testing dataset")
-    parser.add_argument(
-        '--batch_size',
-        type=int,
-        default=1000,
-        help="The size of mini-batch (default:1000)")
-    parser.add_argument(
-        '--embedding_size',
-        type=int,
-        default=10,
-        help="The size for embedding layer (default:10)")
-    parser.add_argument(
-        '--num_passes',
-        type=int,
-        default=10,
-        help="The number of passes to train (default: 10)")
-    parser.add_argument(
-        '--model_output_dir',
-        type=str,
-        default='models',
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--sparse_feature_dim',
-        type=int,
-        default=1000001,
-        help='sparse feature hashing space for index processing')
-    parser.add_argument(
-        '--is_local',
-        type=int,
-        default=1,
-        help='Local train or distributed train (default: 1)')
-    parser.add_argument(
-        '--cloud_train',
-        type=int,
-        default=0,
-        help='Local train or distributed train on paddlecloud (default: 0)')
-    parser.add_argument(
-        '--async_mode',
-        action='store_true',
-        default=False,
-        help='Whether start pserver in async mode to support ASGD')
-    parser.add_argument(
-        '--no_split_var',
-        action='store_true',
-        default=False,
-        help='Whether split variables into blocks when update_method is pserver')
-    parser.add_argument(
-        '--role',
-        type=str,
-        default='pserver',  # trainer or pserver
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--endpoints',
-        type=str,
-        default='127.0.0.1:6000',
-        help='The pserver endpoints, like: 127.0.0.1:6000,127.0.0.1:6001')
-    parser.add_argument(
-        '--current_endpoint',
-        type=str,
-        default='127.0.0.1:6000',
-        help='The path for model to store (default: 127.0.0.1:6000)')
-    parser.add_argument(
-        '--trainer_id',
-        type=int,
-        default=0,
-        help='The path for model to store (default: models)')
-    parser.add_argument(
-        '--trainers',
-        type=int,
-        default=1,
-        help='The num of trianers, (default: 1)')
-    return parser.parse_args()
--- a/python/examples/criteo_ctr/benchmark.py
+++ b/python/examples/criteo_ctr/benchmark.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args
-import requests
-import json
-import criteo_reader as criteo
-args = benchmark_args()
-def single_func(idx, resource):
-    batch = 1
-    buf_size = 100
-    dataset = criteo.CriteoDataset()
-    dataset.setup(1000001)
-    test_filelists = [
-        "./raw_data/part-%d" % x for x in range(len(os.listdir("./raw_data")))
-    ]
-    reader = dataset.infer_reader(test_filelists[len(test_filelists) - 40:],
-                                  batch, buf_size)
-    if args.request == "rpc":
-        fetch = ["prob"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-        start = time.time()
-        for i in range(1000):
-            if args.batch_size == 1:
-                data = reader().next()
-                feed_dict = {}
-                for i in range(1, 27):
-                    feed_dict["sparse_{}".format(i - 1)] = data[0][i]
-                result = client.predict(feed=feed_dict, fetch=fetch)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-    elif args.request == "http":
-        raise ("Not support http service.")
-    end = time.time()
-    return [[end - start]]
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292"]
-    #endpoint_list = endpoint_list + endpoint_list + endpoint_list
-    result = multi_thread_runner.run(single_func, args.thread,
-                                     {"endpoint": endpoint_list})
-    #result = single_func(0, {"endpoint": endpoint_list})
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-    print("average total cost {} s.".format(avg_cost))
--- a/python/examples/criteo_ctr/benchmark.sh
+++ b/python/examples/criteo_ctr/benchmark.sh
-rm profile_log
-for thread_num in 1 2 4 8 16
-do
-    $PYTHONROOT/bin/python benchmark.py --thread $thread_num --model ctr_client_conf/serving_client_conf.prototxt --request rpc > profile 2>&1
-    echo "========================================"
-    echo "batch size : $batch_size" >> profile_log
-    $PYTHONROOT/bin/python ../util/show_profile.py profile $thread_num >> profile_log
-    tail -n 1 profile >> profile_log
-done
--- a/python/examples/criteo_ctr/benchmark_batch.py
+++ b/python/examples/criteo_ctr/benchmark_batch.py
-# -*- coding: utf-8 -*-
-#
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from __future__ import unicode_literals, absolute_import
-import os
-import sys
-import time
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args
-import requests
-import json
-import criteo_reader as criteo
-args = benchmark_args()
-def single_func(idx, resource):
-    batch = 1
-    buf_size = 100
-    dataset = criteo.CriteoDataset()
-    dataset.setup(1000001)
-    test_filelists = [
-        "./raw_data/part-%d" % x for x in range(len(os.listdir("./raw_data")))
-    ]
-    reader = dataset.infer_reader(test_filelists[len(test_filelists) - 40:],
-                                  batch, buf_size)
-    if args.request == "rpc":
-        fetch = ["prob"]
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([resource["endpoint"][idx % len(resource["endpoint"])]])
-        start = time.time()
-        for i in range(1000):
-            if args.batch_size >= 1:
-                feed_batch = []
-                for bi in range(args.batch_size):
-                    feed_dict = {}
-                    data = reader().next()
-                    for i in range(1, 27):
-                        feed_dict["sparse_{}".format(i - 1)] = data[0][i]
-                    feed_batch.append(feed_dict)
-                result = client.predict(feed=feed_batch, fetch=fetch)
-            else:
-                print("unsupport batch size {}".format(args.batch_size))
-    elif args.request == "http":
-        raise ("no batch predict for http")
-    end = time.time()
-    return [[end - start]]
-if __name__ == '__main__':
-    multi_thread_runner = MultiThreadRunner()
-    endpoint_list = ["127.0.0.1:9292"]
-    #endpoint_list = endpoint_list + endpoint_list + endpoint_list
-    result = multi_thread_runner.run(single_func, args.thread,
-                                     {"endpoint": endpoint_list})
-    #result = single_func(0, {"endpoint": endpoint_list})
-    avg_cost = 0
-    for i in range(args.thread):
-        avg_cost += result[0][i]
-    avg_cost = avg_cost / args.thread
-    print("average total cost {} s.".format(avg_cost))
--- a/python/examples/criteo_ctr/benchmark_batch.sh
+++ b/python/examples/criteo_ctr/benchmark_batch.sh
-rm profile_log
-for thread_num in 1 2 4 8 16
-do
-for batch_size in 1 2 4 8 16 32 64 128 256 512
-do
-    $PYTHONROOT/bin/python benchmark_batch.py --thread $thread_num --batch_size $batch_size --model serving_client_conf/serving_client_conf.prototxt --request rpc > profile 2>&1
-    echo "========================================"
-    echo "batch size : $batch_size" >> profile_log
-    $PYTHONROOT/bin/python ../util/show_profile.py profile $thread_num >> profile_log
-    tail -n 1 profile >> profile_log
-done
-done
--- a/python/examples/criteo_ctr/clean.sh
+++ b/python/examples/criteo_ctr/clean.sh
-rm -rf *pyc kvdb raw_data ctr_client_conf ctr_serving_model ctr_data.tar.gz *~
--- a/python/examples/criteo_ctr/get_data.sh
+++ b/python/examples/criteo_ctr/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/data/ctr_prediction/ctr_data.tar.gz
-tar -zxvf ctr_data.tar.gz
--- a/python/examples/criteo_ctr/local_train.py
+++ b/python/examples/criteo_ctr/local_train.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from __future__ import print_function
-from args import parse_args
-import os
-import paddle.fluid as fluid
-import sys
-from network_conf import dnn_model
-dense_feature_dim = 13
-def train():
-    args = parse_args()
-    sparse_only = args.sparse_only
-    if not os.path.isdir(args.model_output_dir):
-        os.mkdir(args.model_output_dir)
-    dense_input = fluid.layers.data(
-        name="dense_input", shape=[dense_feature_dim], dtype='float32')
-    sparse_input_ids = [
-        fluid.layers.data(
-            name="C" + str(i), shape=[1], lod_level=1, dtype="int64")
-        for i in range(1, 27)
-    ]
-    label = fluid.layers.data(name='label', shape=[1], dtype='int64')
-    #nn_input = None if sparse_only else dense_input
-    nn_input = dense_input
-    predict_y, loss, auc_var, batch_auc_var = dnn_model(
-        nn_input, sparse_input_ids, label, args.embedding_size,
-        args.sparse_feature_dim)
-    optimizer = fluid.optimizer.SGD(learning_rate=1e-4)
-    optimizer.minimize(loss)
-    exe = fluid.Executor(fluid.CPUPlace())
-    exe.run(fluid.default_startup_program())
-    dataset = fluid.DatasetFactory().create_dataset("InMemoryDataset")
-    dataset.set_use_var([dense_input] + sparse_input_ids + [label])
-    python_executable = "python"
-    pipe_command = "{} criteo_reader.py {}".format(python_executable,
-                                                   args.sparse_feature_dim)
-    dataset.set_pipe_command(pipe_command)
-    dataset.set_batch_size(128)
-    thread_num = 10
-    dataset.set_thread(thread_num)
-    whole_filelist = [
-        "raw_data/part-%d" % x for x in range(len(os.listdir("raw_data")))
-    ]
-    dataset.set_filelist(whole_filelist[:thread_num])
-    dataset.load_into_memory()
-    epochs = 1
-    for i in range(epochs):
-        exe.train_from_dataset(
-            program=fluid.default_main_program(), dataset=dataset, debug=True)
-        print("epoch {} finished".format(i))
-    import paddle_serving_client.io as server_io
-    feed_var_dict = {}
-    for i, sparse in enumerate(sparse_input_ids):
-        feed_var_dict["sparse_{}".format(i)] = sparse
-    fetch_var_dict = {"prob": predict_y}
-    server_io.save_model("ctr_serving_model", "ctr_client_conf", feed_var_dict,
-                         fetch_var_dict, fluid.default_main_program())
-if __name__ == '__main__':
-    train()
--- a/python/examples/criteo_ctr/network_conf.py
+++ b/python/examples/criteo_ctr/network_conf.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import paddle.fluid as fluid
-import math
-def dnn_model(dense_input, sparse_inputs, label, embedding_size,
-              sparse_feature_dim):
-    def embedding_layer(input):
-        emb = fluid.layers.embedding(
-            input=input,
-            is_sparse=True,
-            is_distributed=False,
-            size=[sparse_feature_dim, embedding_size],
-            param_attr=fluid.ParamAttr(
-                name="SparseFeatFactors",
-                initializer=fluid.initializer.Uniform()))
-        return fluid.layers.sequence_pool(input=emb, pool_type='sum')
-    def mlp_input_tensor(emb_sums, dense_tensor):
-        if isinstance(dense_tensor, fluid.Variable):
-            return fluid.layers.concat(emb_sums, axis=1)
-        else:
-            return fluid.layers.concat(emb_sums + [dense_tensor], axis=1)
-    def mlp(mlp_input):
-        fc1 = fluid.layers.fc(input=mlp_input,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(mlp_input.shape[1]))))
-        fc2 = fluid.layers.fc(input=fc1,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc1.shape[1]))))
-        fc3 = fluid.layers.fc(input=fc2,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc2.shape[1]))))
-        pre = fluid.layers.fc(input=fc3,
-                              size=2,
-                              act='softmax',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc3.shape[1]))))
-        return pre
-    emb_sums = list(map(embedding_layer, sparse_inputs))
-    mlp_in = mlp_input_tensor(emb_sums, dense_input)
-    predict = mlp(mlp_in)
-    cost = fluid.layers.cross_entropy(input=predict, label=label)
-    avg_cost = fluid.layers.reduce_sum(cost)
-    accuracy = fluid.layers.accuracy(input=predict, label=label)
-    auc_var, batch_auc_var, auc_states = \
-        fluid.layers.auc(input=predict, label=label, num_thresholds=2 ** 12, slide_steps=20)
-    return predict, avg_cost, auc_var, batch_auc_var
--- a/python/examples/criteo_ctr/test_client.py
+++ b/python/examples/criteo_ctr/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from paddle_serving_client import Client
-import sys
-import os
-import time
-from paddle_serving_client.metric import auc
-import numpy as np
-import sys
-class CriteoReader(object):
-    def __init__(self, sparse_feature_dim):
-        self.cont_min_ = [0, -3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
-        self.cont_max_ = [
-            20, 600, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.cont_diff_ = [
-            20, 603, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.hash_dim_ = sparse_feature_dim
-        # here, training data are lines with line_index < train_idx_
-        self.train_idx_ = 41256555
-        self.continuous_range_ = range(1, 14)
-        self.categorical_range_ = range(14, 40)
-    def process_line(self, line):
-        features = line.rstrip('\n').split('\t')
-        dense_feature = []
-        sparse_feature = []
-        for idx in self.continuous_range_:
-            if features[idx] == '':
-                dense_feature.append(0.0)
-            else:
-                dense_feature.append((float(features[idx]) - self.cont_min_[idx - 1]) / \
-                                     self.cont_diff_[idx - 1])
-        for idx in self.categorical_range_:
-            sparse_feature.append(
-                [hash(str(idx) + features[idx]) % self.hash_dim_])
-        return sparse_feature
-py_version = sys.version_info[0]
-client = Client()
-client.load_client_config(sys.argv[1])
-client.connect(["127.0.0.1:9292"])
-reader = CriteoReader(1000001)
-batch = 1
-buf_size = 100
-label_list = []
-prob_list = []
-start = time.time()
-f = open(sys.argv[2], 'r')
-for ei in range(10):
-    data = reader.process_line(f.readline())
-    feed_dict = {}
-    for i in range(1, 27):
-        feed_dict["sparse_{}".format(i - 1)] = np.array(data[i-1]).reshape(-1)
-        feed_dict["sparse_{}.lod".format(i - 1)] = [0, len(data[i-1])]
-    fetch_map = client.predict(feed=feed_dict, fetch=["prob"])
-    print(fetch_map)
-end = time.time()
-f.close()
--- a/python/examples/criteo_ctr/test_server.py
+++ b/python/examples/criteo_ctr/test_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_infer_op = op_maker.create('general_infer')
-response_op = op_maker.create('general_response')
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_infer_op)
-op_seq_maker.add_op(response_op)
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.load_model_config(sys.argv[1])
-server.prepare_server(workdir="work_dir1", port=9292, device="cpu")
-server.run_server()
--- a/python/examples/criteo_ctr_with_cube/README.md
+++ b/python/examples/criteo_ctr_with_cube/README.md
-## Criteo CTR with Sparse Parameter Indexing Service
-([简体中文](./README_CN.md)|English)
-### Get Sample Dataset
-go to directory `python/examples/criteo_ctr_with_cube`
-```
-sh get_data.sh
-```
-### Download Model and Sparse Parameter Sequence Files
-```
-wget https://paddle-serving.bj.bcebos.com/unittest/ctr_cube_unittest.tar.gz
-tar xf ctr_cube_unittest.tar.gz
-mv models/ctr_client_conf ./
-mv models/ctr_serving_model_kv ./
-mv models/data ./cube/
-```
-the model will be in ./ctr_server_model_kv and ./ctr_client_config.
-### Start Sparse Parameter Indexing Service
-```
-wget https://paddle-serving.bj.bcebos.com/others/cube_app.tar.gz
-tar xf cube_app.tar.gz
-mv cube_app/cube* ./cube/
-sh cube_prepare.sh &
-```
-Here, the sparse parameter is loaded by cube sparse parameter indexing service Cube.
-### Start RPC Predictor, the number of serving thread is 4（configurable in test_server.py）
-```
-python3 test_server.py ctr_serving_model_kv 
-```
-### Run Prediction
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt ./raw_data
-```
-### Benchmark
-CPU ：Intel(R) Xeon(R) CPU 6148 @ 2.40GHz 
-Model ：[Criteo CTR](https://github.com/PaddlePaddle/Serving/blob/develop/python/examples/criteo_ctr_with_cube/network_conf.py)
-server core/thread num ： 4/8
-Run
-```
-bash benchmark.sh
-```
-1000 batches will be sent by every client
-| client  thread num | prepro | client infer | op0    | op1   | op2    | postpro | avg_latency | qps   |
-| ------------------ | ------ | ------------ | ------ | ----- | ------ | ------- | ----- | ----- |
-| 1                  | 0.035  | 1.596        | 0.021  | 0.518 | 0.0024 | 0.0025  | 6.774 | 147.7 |
-| 2                  | 0.034  | 1.780        | 0.027  | 0.463 | 0.0020 | 0.0023  | 6.931 | 288.3 |
-| 4                  | 0.038  | 2.954        | 0.025  | 0.455 | 0.0019 | 0.0027  | 8.378 | 477.5 |
-| 8                  | 0.044  | 8.230        | 0.028  | 0.464 | 0.0023 | 0.0034  | 14.191 | 563.8 |
-| 16                 | 0.048  | 21.037       | 0.028  | 0.455 | 0.0025 | 0.0041  | 27.236 | 587.5 |
-the average latency of threads
-![avg cost](../../../doc/images/criteo-cube-benchmark-avgcost.png)
-The QPS is 
-![qps](../../../doc/images/criteo-cube-benchmark-qps.png)
--- a/python/examples/criteo_ctr_with_cube/README_CN.md
+++ b/python/examples/criteo_ctr_with_cube/README_CN.md
-## 带稀疏参数索引服务的CTR预测服务
-(简体中文|[English](./README.md))
-### 获取样例数据
-进入目录 `python/examples/criteo_ctr_with_cube`
-```
-sh get_data.sh
-```
-### 下载模型和稀疏参数序列文件
-```
-wget https://paddle-serving.bj.bcebos.com/unittest/ctr_cube_unittest.tar.gz
-tar xf ctr_cube_unittest.tar.gz
-mv models/ctr_client_conf ./
-mv models/ctr_serving_model_kv ./
-mv models/data ./cube/
-```
-执行脚本后会在当前目录有ctr_server_model_kv和ctr_client_config文件夹。
-### 启动稀疏参数索引服务
-```
-wget https://paddle-serving.bj.bcebos.com/others/cube_app.tar.gz
-tar xf cube_app.tar.gz
-mv cube_app/cube* ./cube/
-sh cube_prepare.sh &
-```
-此处，模型当中的稀疏参数会被存放在稀疏参数索引服务Cube当中。
-### 启动RPC预测服务，服务端线程数为4（可在test_server.py配置）
-```
-python3 test_server.py ctr_serving_model_kv 
-```
-### 执行预测
-```
-python3 test_client.py ctr_client_conf/serving_client_conf.prototxt ./raw_data
-```
-### Benchmark
-设备 ：Intel(R) Xeon(R) CPU 6148 @ 2.40GHz 
-模型 ：[Criteo CTR](https://github.com/PaddlePaddle/Serving/blob/develop/python/examples/criteo_ctr_with_cube/network_conf.py)
-server core/thread num ： 4/8
-执行
-```
-bash benchmark.sh
-```
-客户端每个线程会发送1000个batch
-| client  thread num | prepro | client infer | op0    | op1   | op2    | postpro | avg_latency | qps   |
-| ------------------ | ------ | ------------ | ------ | ----- | ------ | ------- | ----- | ----- |
-| 1                  | 0.035  | 1.596        | 0.021  | 0.518 | 0.0024 | 0.0025  | 6.774 | 147.7 |
-| 2                  | 0.034  | 1.780        | 0.027  | 0.463 | 0.0020 | 0.0023  | 6.931 | 288.3 |
-| 4                  | 0.038  | 2.954        | 0.025  | 0.455 | 0.0019 | 0.0027  | 8.378 | 477.5 |
-| 8                  | 0.044  | 8.230        | 0.028  | 0.464 | 0.0023 | 0.0034  | 14.191 | 563.8 |
-| 16                 | 0.048  | 21.037       | 0.028  | 0.455 | 0.0025 | 0.0041  | 27.236 | 587.5 |
-平均每个线程耗时图如下
-![avg cost](../../../doc/images/criteo-cube-benchmark-avgcost.png)
-每个线程QPS耗时如下
-![qps](../../../doc/images/criteo-cube-benchmark-qps.png)
--- a/python/examples/criteo_ctr_with_cube/criteo_reader.py
+++ b/python/examples/criteo_ctr_with_cube/criteo_reader.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import sys
-import paddle.fluid.incubate.data_generator as dg
-class CriteoDataset(dg.MultiSlotDataGenerator):
-    def setup(self, sparse_feature_dim):
-        self.cont_min_ = [0, -3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
-        self.cont_max_ = [
-            20, 600, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.cont_diff_ = [
-            20, 603, 100, 50, 64000, 500, 100, 50, 500, 10, 10, 10, 50
-        ]
-        self.hash_dim_ = sparse_feature_dim
-        # here, training data are lines with line_index < train_idx_
-        self.train_idx_ = 41256555
-        self.continuous_range_ = range(1, 14)
-        self.categorical_range_ = range(14, 40)
-    def _process_line(self, line):
-        features = line.rstrip('\n').split('\t')
-        dense_feature = []
-        sparse_feature = []
-        for idx in self.continuous_range_:
-            if features[idx] == '':
-                dense_feature.append(0.0)
-            else:
-                dense_feature.append((float(features[idx]) - self.cont_min_[idx - 1]) / \
-                                     self.cont_diff_[idx - 1])
-        for idx in self.categorical_range_:
-            sparse_feature.append(
-                [hash(str(idx) + features[idx]) % self.hash_dim_])
-        return dense_feature, sparse_feature, [int(features[0])]
-    def infer_reader(self, filelist, batch, buf_size):
-        def local_iter():
-            for fname in filelist:
-                with open(fname.strip(), "r") as fin:
-                    for line in fin:
-                        dense_feature, sparse_feature, label = self._process_line(
-                            line)
-                        #yield dense_feature, sparse_feature, label
-                        yield [dense_feature] + sparse_feature + [label]
-        import paddle
-        batch_iter = paddle.batch(
-            paddle.reader.shuffle(
-                local_iter, buf_size=buf_size),
-            batch_size=batch)
-        return batch_iter
-    def generate_sample(self, line):
-        def data_iter():
-            dense_feature, sparse_feature, label = self._process_line(line)
-            feature_name = ["dense_input"]
-            for idx in self.categorical_range_:
-                feature_name.append("C" + str(idx - 13))
-            feature_name.append("label")
-            yield zip(feature_name, [dense_feature] + sparse_feature + [label])
-        return data_iter
-if __name__ == "__main__":
-    criteo_dataset = CriteoDataset()
-    criteo_dataset.setup(int(sys.argv[1]))
-    criteo_dataset.run_from_stdin()
--- a/python/examples/criteo_ctr_with_cube/cube/conf/cube.conf
+++ b/python/examples/criteo_ctr_with_cube/cube/conf/cube.conf
-[{
-    "dict_name": "test_dict",
-    "shard": 1,
-    "dup": 1,
-    "timeout": 200,
-    "retry": 3,
-    "backup_request": 100,
-    "type": "ipport_list",
-    "load_balancer": "rr",
-    "nodes": [{
-        "ipport_list": "list://127.0.0.1:8027"
-    }]
-}]
--- a/python/examples/criteo_ctr_with_cube/cube/conf/gflags.conf
+++ b/python/examples/criteo_ctr_with_cube/cube/conf/gflags.conf
--port=8027
--dict_split=1
--in_mem=true
--log_dir=./log/
--- a/python/examples/criteo_ctr_with_cube/cube/keys
+++ b/python/examples/criteo_ctr_with_cube/cube/keys
-1
-2
-3
-4
-5
-6
-7
-8
-9
-10
--- a/python/examples/criteo_ctr_with_cube/cube_prepare.sh
+++ b/python/examples/criteo_ctr_with_cube/cube_prepare.sh
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-#! /bin/bash
-mkdir -p cube_model
-mkdir -p cube/data
-./cube/cube-builder -dict_name=test_dict -job_mode=base -last_version=0 -cur_version=0 -depend_version=0 -input_path=./cube_model -output_path=${PWD}/cube/data -shard_num=1  -only_build=false
-cd cube && ./cube
--- a/python/examples/criteo_ctr_with_cube/get_data.sh
+++ b/python/examples/criteo_ctr_with_cube/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/data/ctr_prediction/ctr_data.tar.gz
-tar -zxvf ctr_data.tar.gz
--- a/python/examples/criteo_ctr_with_cube/local_train.py
+++ b/python/examples/criteo_ctr_with_cube/local_train.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from __future__ import print_function
-from args import parse_args
-import os
-import paddle.fluid as fluid
-import paddle
-import sys
-from network_conf import dnn_model
-dense_feature_dim = 13
-paddle.enable_static()
-def train():
-    args = parse_args()
-    sparse_only = args.sparse_only
-    if not os.path.isdir(args.model_output_dir):
-        os.mkdir(args.model_output_dir)
-    dense_input = fluid.layers.data(
-        name="dense_input", shape=[dense_feature_dim], dtype='float32')
-    sparse_input_ids = [
-        fluid.layers.data(
-            name="C" + str(i), shape=[1], lod_level=1, dtype="int64")
-        for i in range(1, 27)
-    ]
-    label = fluid.layers.data(name='label', shape=[1], dtype='int64')
-    #nn_input = None if sparse_only else dense_input
-    nn_input = dense_input
-    predict_y, loss, auc_var, batch_auc_var, infer_vars = dnn_model(
-        nn_input, sparse_input_ids, label, args.embedding_size,
-        args.sparse_feature_dim)
-    optimizer = fluid.optimizer.SGD(learning_rate=1e-4)
-    optimizer.minimize(loss)
-    exe = fluid.Executor(fluid.CPUPlace())
-    exe.run(fluid.default_startup_program())
-    dataset = fluid.DatasetFactory().create_dataset("InMemoryDataset")
-    dataset.set_use_var([dense_input] + sparse_input_ids + [label])
-    python_executable = "python3.6"
-    pipe_command = "{} criteo_reader.py {}".format(python_executable,
-                                                   args.sparse_feature_dim)
-    dataset.set_pipe_command(pipe_command)
-    dataset.set_batch_size(128)
-    thread_num = 10
-    dataset.set_thread(thread_num)
-    whole_filelist = [
-        "raw_data/part-%d" % x for x in range(len(os.listdir("raw_data")))
-    ]
-    print(whole_filelist)
-    dataset.set_filelist(whole_filelist[:100])
-    dataset.load_into_memory()
-    fluid.layers.Print(auc_var)
-    epochs = 1
-    for i in range(epochs):
-        exe.train_from_dataset(
-            program=fluid.default_main_program(), dataset=dataset, debug=True)
-        print("epoch {} finished".format(i))
-    import paddle_serving_client.io as server_io
-    feed_var_dict = {}
-    feed_var_dict['dense_input'] = dense_input
-    for i, sparse in enumerate(sparse_input_ids):
-        feed_var_dict["embedding_{}.tmp_0".format(i)] = sparse
-    fetch_var_dict = {"prob": predict_y}
-    feed_kv_dict = {}
-    feed_kv_dict['dense_input'] = dense_input
-    for i, emb in enumerate(infer_vars):
-        feed_kv_dict["embedding_{}.tmp_0".format(i)] = emb
-    fetch_var_dict = {"prob": predict_y}
-    server_io.save_model("ctr_serving_model", "ctr_client_conf", feed_var_dict,
-                         fetch_var_dict, fluid.default_main_program())
-    server_io.save_model("ctr_serving_model_kv", "ctr_client_conf_kv",
-                         feed_kv_dict, fetch_var_dict,
-                         fluid.default_main_program())
-if __name__ == '__main__':
-    train()
--- a/python/examples/criteo_ctr_with_cube/network_conf.py
+++ b/python/examples/criteo_ctr_with_cube/network_conf.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import paddle.fluid as fluid
-import math
-def dnn_model(dense_input, sparse_inputs, label, embedding_size,
-              sparse_feature_dim):
-    def embedding_layer(input):
-        emb = fluid.layers.embedding(
-            input=input,
-            is_sparse=True,
-            is_distributed=False,
-            size=[sparse_feature_dim, embedding_size],
-            param_attr=fluid.ParamAttr(
-                name="SparseFeatFactors",
-                initializer=fluid.initializer.Uniform()))
-        x = fluid.layers.sequence_pool(input=emb, pool_type='sum')
-        return emb, x
-    def mlp_input_tensor(emb_sums, dense_tensor):
-        #if isinstance(dense_tensor, fluid.Variable):
-        #    return fluid.layers.concat(emb_sums, axis=1)
-        #else:
-        return fluid.layers.concat(emb_sums + [dense_tensor], axis=1)
-    def mlp(mlp_input):
-        fc1 = fluid.layers.fc(input=mlp_input,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(mlp_input.shape[1]))))
-        fc2 = fluid.layers.fc(input=fc1,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc1.shape[1]))))
-        fc3 = fluid.layers.fc(input=fc2,
-                              size=400,
-                              act='relu',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc2.shape[1]))))
-        pre = fluid.layers.fc(input=fc3,
-                              size=2,
-                              act='softmax',
-                              param_attr=fluid.ParamAttr(
-                                  initializer=fluid.initializer.Normal(
-                                      scale=1 / math.sqrt(fc3.shape[1]))))
-        return pre
-    emb_pair_sums = list(map(embedding_layer, sparse_inputs))
-    emb_sums = [x[1] for x in emb_pair_sums]
-    infer_vars = [x[0] for x in emb_pair_sums]
-    mlp_in = mlp_input_tensor(emb_sums, dense_input)
-    predict = mlp(mlp_in)
-    cost = fluid.layers.cross_entropy(input=predict, label=label)
-    avg_cost = fluid.layers.reduce_sum(cost)
-    accuracy = fluid.layers.accuracy(input=predict, label=label)
-    auc_var, batch_auc_var, auc_states = \
-        fluid.layers.auc(input=predict, label=label, num_thresholds=2 ** 12, slide_steps=20)
-    return predict, avg_cost, auc_var, batch_auc_var, infer_vars
--- a/python/examples/criteo_ctr_with_cube/test_client.py
+++ b/python/examples/criteo_ctr_with_cube/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from paddle_serving_client import Client
-import sys
-import os
-import criteo_reader as criteo
-import time
-from paddle_serving_client.metric import auc
-import numpy as np
-py_version = sys.version_info[0]
-client = Client()
-client.load_client_config(sys.argv[1])
-client.connect(["127.0.0.1:9292"])
-batch = 1
-buf_size = 100
-dataset = criteo.CriteoDataset()
-dataset.setup(1000001)
-test_filelists = ["{}/part-0".format(sys.argv[2])]
-reader = dataset.infer_reader(test_filelists, batch, buf_size)
-label_list = []
-prob_list = []
-start = time.time()
-for ei in range(100):
-    if py_version == 2:
-        data = reader().next()
-    else:
-        data = reader().__next__()
-    feed_dict = {}
-    feed_dict['dense_input'] = np.array(data[0][0]).reshape(1, len(data[0][0]))
-    for i in range(1, 27):
-        feed_dict["embedding_{}.tmp_0".format(i - 1)] = np.array(data[0][i]).reshape(len(data[0][i]))
-        feed_dict["embedding_{}.tmp_0.lod".format(i - 1)] = [0, len(data[0][i])]
-    fetch_map = client.predict(feed=feed_dict, fetch=["prob"],batch=True)
-    print(fetch_map)
-    prob_list.append(fetch_map['prob'][0][1])
-    label_list.append(data[0][-1][0])
-end = time.time()
-print(end - start)
--- a/python/examples/criteo_ctr_with_cube/test_server.py
+++ b/python/examples/criteo_ctr_with_cube/test_server.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-import os
-import sys
-from paddle_serving_server import OpMaker
-from paddle_serving_server import OpSeqMaker
-from paddle_serving_server import Server
-op_maker = OpMaker()
-read_op = op_maker.create('general_reader')
-general_dist_kv_infer_op = op_maker.create('general_dist_kv_infer')
-response_op = op_maker.create('general_response')
-op_seq_maker = OpSeqMaker()
-op_seq_maker.add_op(read_op)
-op_seq_maker.add_op(general_dist_kv_infer_op)
-op_seq_maker.add_op(response_op)
-server = Server()
-server.set_op_sequence(op_seq_maker.get_op_sequence())
-server.set_num_threads(4)
-server.load_model_config(sys.argv[1])
-server.prepare_server(
-    workdir="work_dir1",
-    port=9292,
-    device="cpu",
-    cube_conf="./cube/conf/cube.conf")
-server.run_server()
--- a/python/examples/deeplabv3/N0060.jpg
+++ b/python/examples/deeplabv3/N0060.jpg
--- a/python/examples/deeplabv3/README.md
+++ b/python/examples/deeplabv3/README.md
-# Image Segmentation
-## Get Model
-```
-python3 -m paddle_serving_app.package --get_model deeplabv3
-tar -xzvf deeplabv3.tar.gz
-```
-## RPC Service
-### Start Service
-```
-python3 -m paddle_serving_server.serve --model deeplabv3_server --gpu_ids 0 --port 9494
-```
-### Client Prediction
-```
-python3 deeplabv3_client.py
-```
--- a/python/examples/deeplabv3/README_CN.md
+++ b/python/examples/deeplabv3/README_CN.md
-# 图像分割
-## 获取模型
-```
-python3 -m paddle_serving_app.package --get_model deeplabv3
-tar -xzvf deeplabv3.tar.gz
-```
-## RPC 服务
-### 启动服务端
-```
-python3 -m paddle_serving_server.serve --model deeplabv3_server --gpu_ids 0 --port 9494
-```
-### 客户端预测
-```
-python3 deeplabv3_client.py
--- a/python/examples/deeplabv3/deeplabv3_client.py
+++ b/python/examples/deeplabv3/deeplabv3_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-from paddle_serving_client import Client
-from paddle_serving_app.reader import Sequential, File2Image, Resize, Transpose, BGR2RGB, SegPostprocess
-import sys
-import cv2
-client = Client()
-client.load_client_config("deeplabv3_client/serving_client_conf.prototxt")
-client.connect(["127.0.0.1:9494"])
-preprocess = Sequential(
-    [File2Image(), Resize(
-        (512, 512), interpolation=cv2.INTER_LINEAR)])
-postprocess = SegPostprocess(2)
-filename = "N0060.jpg"
-im = preprocess(filename)
-fetch_map = client.predict(feed={"image": im}, fetch=["output"])
-fetch_map["filename"] = filename
-postprocess(fetch_map)
--- a/python/examples/detection/README.md
+++ b/python/examples/detection/README.md
-# Serve models from Paddle Detection
-(English|[简体中文](./README_CN.md))
-### Introduction
-PaddleDetection flying paddle target detection development kit is designed to help developers complete the whole development process of detection model formation, training, optimization and deployment faster and better. For details, see [Github](https://github.com/PaddlePaddle/PaddleDetection/tree/master)
-This article mainly introduces the deployment of Paddle Detection's dynamic graph model on Serving.
-Paddle Detection provides a large number of [Model Zoo](https://github.com/PaddlePaddle/PaddleDetection/blob/master/docs/MODEL_ZOO_cn.md), these model libraries can be used in Paddle Serving with export tools Model. For the export tutorial, please refer to [Paddle Detection Export Model Tutorial (Simplified Chinese)](https://github.com/PaddlePaddle/PaddleDetection/blob/master/deploy/EXPORT_MODEL.md).
-### Serving example
-Several examples of PaddleDetection models used in Serving are given in this folder
-All examples support TensorRT.
- [Faster RCNN](./faster_rcnn_r50_fpn_1x_coco)
- [PPYOLO](./ppyolo_r50vd_dcn_1x_coco)
- [TTFNet](./ttfnet_darknet53_1x_coco)
- [YOLOv3](./yolov3_darknet53_270e_coco)
- [HRNet](./faster_rcnn_hrnetv2p_w18_1x)
- [Fcos](./fcos_dcn_r50_fpn_1x_coco)
- [SSD](./ssd_vgg16_300_240e_voc/)
--- a/python/examples/detection/README_CN.md
+++ b/python/examples/detection/README_CN.md
-## 使用Paddle Detection模型
-([English](./README.md)|简体中文)
-### 简介
-PaddleDetection飞桨目标检测开发套件，旨在帮助开发者更快更好地完成检测模型的组建、训练、优化及部署等全开发流程。详情参见[Github](https://github.com/PaddlePaddle/PaddleDetection/tree/master)
-本文主要是介绍Paddle Detection的动态图模型在Serving上的部署。
-### 导出模型
-Paddle Detection提供了大量的[模型库](https://github.com/PaddlePaddle/PaddleDetection/blob/master/docs/MODEL_ZOO_cn.md), 这些模型库配合导出工具都可以得到可以用于Paddle Serving的模型。导出教程参见[Paddle Detection模型导出教程](https://github.com/PaddlePaddle/PaddleDetection/blob/master/deploy/EXPORT_MODEL.md)。
-### Serving示例
-本文件夹下给出了多个PaddleDetection模型用于Serving的范例
- [Faster RCNN](./faster_rcnn_r50_fpn_1x_coco)
- [PPYOLO](./ppyolo_r50vd_dcn_1x_coco)
- [TTFNet](./ttfnet_darknet53_1x_coco)
- [YOLOv3](./yolov3_darknet53_270e_coco)
- [HRNet](./faster_rcnn_hrnetv2p_w18_1x)
- [Fcos](./fcos_dcn_r50_fpn_1x_coco)
- [SSD](./ssd_vgg16_300_240e_voc/)
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/000000570688.jpg
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/000000570688.jpg
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
-# Faster RCNN HRNet model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get The Faster RCNN HRNet Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/faster_rcnn_hrnetv2p_w18_1x.tar.gz
-```
-### Start the service
-```
-tar xf faster_rcnn_hrnetv2p_w18_1x.tar.gz
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`. But you need to do some extra work.
-Please reference to https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40 
-### Prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
-# 使用Paddle Serving部署Faster RCNN HRNet模型
-(简体中文|[English](./README.md))
-## 获得Faster RCNN HRNet模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/faster_rcnn_hrnetv2p_w18_1x.tar.gz
-```
-### 启动服务
-```
-tar xf faster_rcnn_hrnetv2p_w18_1x.tar.gz
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项,但此时需要额外设置子图的TRT变长最大最小最优shape.
-请参考https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/label_list.txt
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/test_client.py
+++ b/python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-       DetectionFile2Image(),
-       DetectionResize((800, 1333), True, interpolation=2), 
-       DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-       DetectionTranspose((2,0,1)),
-       DetectionPadStride(32)
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/000000570688.jpg
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/000000570688.jpg
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README.md
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README.md
-# Faster RCNN model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get The Faster RCNN Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/faster_rcnn_r50_fpn_1x_coco.tar
-```
-### Start the service
-```
-tar xf faster_rcnn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`. But you need to do some extra work.
-Please reference to https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40 
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
-## 3. Result analysis
-<p align = "center">
-    <br>
-<img src = '000000570688.jpg'>
-    <br>
-<p>
-This is the input picture
-<p align = "center">
-    <br>
-<img src = '000000570688_bbox.jpg'>
-    <br>
-<p>
-This is the picture after adding bbox. You can see that the client has done post-processing for the picture. In addition, the output/bbox.json also has the number and coordinate information of each box.
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README_CN.md
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/README_CN.md
-# 使用Paddle Serving部署Faster RCNN模型
-(简体中文|[English](./README.md))
-## 获得Faster RCNN模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/faster_rcnn_r50_fpn_1x_coco.tar
-```
-### 启动服务
-```
-tar xf faster_rcnn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项,但此时需要额外设置子图的TRT变长最大最小最优shape.
-请参考https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/c%2B%2B/paddle-trt/trt_dynamic_shape_test.cc#L40
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
-## 3. 结果分析
-<p align="center">
-    <br>
-<img src='000000570688.jpg' >
-    <br>
-<p>
-这是输入图片
-<p align="center">
-    <br>
-<img src='000000570688_bbox.jpg' >
-    <br>
-<p>
-这是实现添加了bbox之后的图片，可以看到客户端已经为图片做好了后处理，此外在output/bbox.json也有各个框的编号和坐标信息。
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/label_list.txt
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/test_client.py
+++ b/python/examples/detection/faster_rcnn_r50_fpn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (800, 1333), True, interpolation=cv2.INTER_LINEAR), 
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(128)
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/000000014439.jpg
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/000000014439.jpg
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
-# FCOS model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/fcos_dcn_r50_fpn_1x_coco.tar
-```
-### Start the service
-```
-tar xf fcos_dcn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-### Perform prediction
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
-# 使用Paddle Serving部署FCOS模型
-(简体中文|[English](./README.md))
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/fcos_dcn_r50_fpn_1x_coco.tar
-```
-### 启动服务
-```
-tar xf fcos_dcn_r50_fpn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-### 执行预测
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/label_list.txt
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/test_client.py
+++ b/python/examples/detection/fcos_dcn_r50_fpn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (800, 1333), True, interpolation=cv2.INTER_LINEAR), 
-        DetectionTranspose((2,0,1)),
-        DetectionPadStride(128)
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/000000570688.jpg
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/000000570688.jpg
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README.md
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README.md
-# PP-YOLO model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get The Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ppyolo_r50vd_dcn_1x_coco.tar
-```
-### Start the service
-```
-tar xf ppyolo_r50vd_dcn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README_CN.md
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/README_CN.md
-# 使用Paddle Serving部署PP-YOLO模型
-(简体中文|[English](./README.md))
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ppyolo_r50vd_dcn_1x_coco.tar
-```
-### 启动服务
-```
-tar xf ppyolo_r50vd_dcn_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/label_list.txt
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/test_client.py
+++ b/python/examples/detection/ppyolo_r50vd_dcn_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionResize(
-        (608, 608), False, interpolation=2), 
-        DetectionTranspose((2,0,1))
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/000000014439.jpg
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/000000014439.jpg
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/README.md
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/README.md
-# SSD model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ssd_vgg16_300_240e_voc.tar
-```
-### Start the service
-```
-tar xf ssd_vgg16_300_240e_voc.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-### Perform prediction
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
-# 使用Paddle Serving部署SSD模型
-(简体中文|[English](./README.md))
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/ssd_vgg16_300_240e_voc.tar
-```
-### 启动服务
-```
-tar xf ssd_vgg16_300_240e_voc.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-### 执行预测
-```
-python3 test_client.py 000000014439.jpg
-```
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/label_list.txt
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/label_list.txt
-aeroplane
-bicycle
-bird
-boat
-bottle
-bus
-car
-cat
-chair
-cow
-diningtable
-dog
-horse
-motorbike
-person
-pottedplant
-sheep
-sofa
-train
-tvmonitor
--- a/python/examples/detection/ssd_vgg16_300_240e_voc/test_client.py
+++ b/python/examples/detection/ssd_vgg16_300_240e_voc/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize(
-        (300, 300), False, interpolation=cv2.INTER_LINEAR), 
-        DetectionNormalize([104.0, 117.0, 123.0], [1.0, 1.0, 1.0], False),
-        DetectionTranspose((2,0,1)),
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/000000570688.jpg
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/000000570688.jpg
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/README.md
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/README.md
-# TTF-Net model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/ttfnet_darknet53_1x_coco.tar
-```
-### Start the service
-```
-tar xf ttfnet_darknet53_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/README_CN.md
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/README_CN.md
-# 使用Paddle Serving部署TTF-Net模型
-(简体中文|[English](./README.md))
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/ttfnet_darknet53_1x_coco.tar
-```
-### 启动服务
-```
-tar xf ttfnet_darknet53_1x_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/label_list.txt
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/ttfnet_darknet53_1x_coco/test_client.py
+++ b/python/examples/detection/ttfnet_darknet53_1x_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize(
-        (512, 512), False, interpolation=cv2.INTER_LINEAR),
-        DetectionNormalize([123.675, 116.28, 103.53], [58.395, 57.12, 57.375], False),
-        DetectionTranspose((2,0,1))
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-print(fetch_map)
--- a/python/examples/detection/yolov3_darknet53_270e_coco/000000570688.jpg
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/000000570688.jpg
--- a/python/examples/detection/yolov3_darknet53_270e_coco/README.md
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/README.md
-# YOLOv3 model on Paddle Serving
-([简体中文](./README_CN.md)|English)
-### Get Model
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/yolov3_darknet53_270e_coco.tar
-```
-### Start the service
-```
-tar xf yolov3_darknet53_270e_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-This model support TensorRT, if you want a faster inference, please use `--use_trt`.
-### Perform prediction
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/yolov3_darknet53_270e_coco/README_CN.md
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/README_CN.md
-# 使用Paddle Serving部署YOLOv3模型
-(简体中文|[English](./README.md))
-## 获得模型
-```
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/yolov3_darknet53_270e_coco.tar
-```
-### 启动服务
-```
-tar xf yolov3_darknet53_270e_coco.tar
-python3 -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
-```
-该模型支持TensorRT，如果想要更快的预测速度，可以开启`--use_trt`选项。
-### 执行预测
-```
-python3 test_client.py 000000570688.jpg
-```
--- a/python/examples/detection/yolov3_darknet53_270e_coco/label_list.txt
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/label_list.txt
-person
-bicycle
-car
-motorcycle
-airplane
-bus
-train
-truck
-boat
-traffic light
-fire hydrant
-stop sign
-parking meter
-bench
-bird
-cat
-dog
-horse
-sheep
-cow
-elephant
-bear
-zebra
-giraffe
-backpack
-umbrella
-handbag
-tie
-suitcase
-frisbee
-skis
-snowboard
-sports ball
-kite
-baseball bat
-baseball glove
-skateboard
-surfboard
-tennis racket
-bottle
-wine glass
-cup
-fork
-knife
-spoon
-bowl
-banana
-apple
-sandwich
-orange
-broccoli
-carrot
-hot dog
-pizza
-donut
-cake
-chair
-couch
-potted plant
-bed
-dining table
-toilet
-tv
-laptop
-mouse
-remote
-keyboard
-cell phone
-microwave
-oven
-toaster
-sink
-refrigerator
-book
-clock
-vase
-scissors
-teddy bear
-hair drier
-toothbrush
--- a/python/examples/detection/yolov3_darknet53_270e_coco/test_client.py
+++ b/python/examples/detection/yolov3_darknet53_270e_coco/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-import sys
-import numpy as np
-from paddle_serving_client import Client
-from paddle_serving_app.reader import *
-import cv2
-preprocess = DetectionSequential([
-        DetectionFile2Image(),
-        DetectionResize(
-        (608, 608), False, interpolation=2), 
-        DetectionNormalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225], True),
-        DetectionTranspose((2,0,1)),
-])
-postprocess = RCNNPostprocess("label_list.txt", "output")
-client = Client()
-client.load_client_config("serving_client/serving_client_conf.prototxt")
-client.connect(['127.0.0.1:9494'])
-im, im_info = preprocess(sys.argv[1])
-fetch_map = client.predict(
-    feed={
-        "image": im,
-        "im_shape": np.array(list(im.shape[1:])).reshape(-1),
-        "scale_factor": im_info['scale_factor'],
-    },
-    fetch=["save_infer_model/scale_0.tmp_1"],
-    batch=False)
-fetch_map["image"] = sys.argv[1]
-postprocess(fetch_map)
--- a/python/examples/encryption/README.md
+++ b/python/examples/encryption/README.md
-# Encryption Model Prediction
-([简体中文](README_CN.md)|English)
-## Get Origin Model
-The example uses the model file of the fit_a_line example as a origin model
-```
-sh get_data.sh
-```
-## Encrypt Model
-The `paddlepaddle` package is used in this example, you may need to download the corresponding package(`pip3 install paddlepaddle`).
-[python3 encrypt.py](./encrypt.py)
-[//file]:#encrypt.py
-``` python
-def serving_encryption():
-    inference_model_to_serving(
-        dirname="./uci_housing_model",
-        params_filename=None,
-        serving_server="encrypt_server",
-        serving_client="encrypt_client",
-        encryption=True)
-```
-dirname is the folder path where the model is located. If the parameter is discrete, it is unnecessary to specify params_filename, else you need to set `params_filename="__params__"`.
-The key is stored in the `key` file, and the encrypted model file and server-side configuration file are stored in the `encrypt_server` directory.
-client-side configuration file are stored in the `encrypt_client` directory.
-**Notice：** When encryption prediction is used, the model configuration and parameter folder loaded by server and client should be encrypt_server/ and encrypt_client/
-## Start Encryption Service
-CPU Service
-```
-python3 -m paddle_serving_server.serve --model encrypt_server/ --port 9393 --use_encryption_model
-```
-GPU Service
-```
-python3 -m paddle_serving_server.serve --model encrypt_server/ --port 9393 --use_encryption_model --gpu_ids 0
-```
-## Prediction
-```
-python3 test_client.py encrypt_client/serving_client_conf.prototxt
-```
--- a/python/examples/encryption/README_CN.md
+++ b/python/examples/encryption/README_CN.md
-# 加密模型预测
-(简体中文|[English](README.md))
-## 获取明文模型
-示例中使用fit_a_line示例的模型文件作为明文模型
-```
-sh get_data.sh
-```
-## 模型加密
-本示例中使用了`paddlepaddle`包中的模块，需要进行下载（`pip3 install paddlepaddle`）。
-运行[python3 encrypt.py](./encrypt.py)进行模型加密
-[//file]:#encrypt.py
-``` python
-def serving_encryption():
-    inference_model_to_serving(
-        dirname="./uci_housing_model",
-        params_filename=None,
-        serving_server="encrypt_server",
-        serving_client="encrypt_client",
-        encryption=True)
-```
-其中dirname为模型所在的文件夹路径
-当参数为离散参数时，无须指定params_filename，当参数为__params__时，需指定`params_filename="__params__"`
-密钥保存在`key`文件中，加密模型文件以及server端配置文件保存在`encrypt_server`目录下，client端配置文件保存在`encrypt_client`目录下。
-**注意：** 当使用加密预测时，服务端和客户端启动加载的模型配置和参数文件夹是encrypt_server/和encrypt_client/
-## 启动加密预测服务
-CPU预测服务
-```
-python3 -m paddle_serving_server.serve --model encrypt_server/ --port 9393 --use_encryption_model
-```
-GPU预测服务
-```
-python3 -m paddle_serving_server.serve --model encrypt_server/ --port 9393 --use_encryption_model --gpu_ids 0
-```
-## 预测
-```
-python3 test_client.py encrypt_client/serving_client_conf.prototxt
-```
--- a/python/examples/encryption/encrypt.py
+++ b/python/examples/encryption/encrypt.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-from paddle_serving_client.io import inference_model_to_serving
-def serving_encryption():
-    inference_model_to_serving(
-        dirname="./uci_housing_model",
-        params_filename=None,
-        serving_server="encrypt_server",
-        serving_client="encrypt_client",
-        encryption=True)
-if __name__ == "__main__":
-    serving_encryption()
--- a/python/examples/encryption/get_data.sh
+++ b/python/examples/encryption/get_data.sh
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/uci_housing_example/encrypt.tar.gz
-tar -xzf encrypt.tar.gz
-wget --no-check-certificate https://paddle-serving.bj.bcebos.com/uci_housing.tar.gz
-tar -xzf uci_housing.tar.gz
--- a/python/examples/encryption/test_client.py
+++ b/python/examples/encryption/test_client.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from paddle_serving_client import Client
-import sys
-client = Client()
-client.load_client_config(sys.argv[1])
-client.use_key("./key")
-client.connect(["0.0.0.0:9393"], encryption=True)
-fetch_list = client.get_fetch_names()
-import paddle
-test_reader = paddle.batch(
-    paddle.reader.shuffle(
-        paddle.dataset.uci_housing.test(), buf_size=500),
-    batch_size=1)
-for data in test_reader():
-    fetch_map = client.predict(feed={"x": data[0][0]}, fetch=fetch_list)
-    print(fetch_map)
--- a/python/examples/fit_a_line/README.md
+++ b/python/examples/fit_a_line/README.md
-# Fit a line prediction example
-([简体中文](./README_CN.md)|English)
-## Get data
-```shell
-sh get_data.sh
-```
-## RPC service
-### Start server
-```shell
-python3 -m paddle_serving_server.serve --model uci_housing_model --thread 10 --port 9393
-```
-## Client prediction
-### RPC Client
-The `paddlepaddle` package is used in `test_client.py`, and you may need to download the corresponding package(`pip3 install paddlepaddle`).
-``` shell
-python3 test_client.py uci_housing_client/serving_client_conf.prototxt
-```
-### Http Client
-``` shell
-python3 test_httpclient.py uci_housing_client/serving_client_conf.prototxt
-```
-## Benchmark
-``` shell
-bash benchmark.sh uci_housing_model uci_housing_client
-```
-The log file of benchmark named `profile_log_uci_housing_model`
--- a/python/examples/fit_a_line/README_CN.md
+++ b/python/examples/fit_a_line/README_CN.md
-# 线性回归预测服务示例
-(简体中文|[English](./README.md))
-## 获取数据
-```shell
-sh get_data.sh
-```
-## 开启服务端（支持BRPC-Client/GRPC Client/Http-Client）
-```shell
-python3 -m paddle_serving_server.serve --model uci_housing_model --thread 10 --port 9393
-```
-## 客户端预测
-### BRPC-Client
-`test_client.py`中使用了`paddlepaddle`包，需要进行下载（`pip3 install paddlepaddle`）。
-``` shell
-python3 test_client.py uci_housing_client/serving_client_conf.prototxt
-```
-### GRPC-Client/Http-Client
-``` shell
-python3 test_httpclient.py uci_housing_client/serving_client_conf.prototxt
-```
-## 性能测试
-``` shell
-bash benchmark.sh uci_housing_model uci_housing_client
-```
-性能测试的日志文件为profile_log_uci_housing_model
-如需修改性能测试用例的参数，请修改benchmark.sh中的配置信息。
-注意:uci_housing_model和uci_housing_client路径后不要加'/'符号,示例需要在GPU机器上运行。
--- a/python/examples/fit_a_line/benchmark.py
+++ b/python/examples/fit_a_line/benchmark.py
-# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# pylint: disable=doc-string-missing
-from paddle_serving_client import Client
-from paddle_serving_client.utils import MultiThreadRunner
-from paddle_serving_client.utils import benchmark_args, show_latency
-import time
-import paddle
-import sys
-import requests
-args = benchmark_args()
-def single_func(idx, resource):
-    train_reader = paddle.batch(
-        paddle.reader.shuffle(
-            paddle.dataset.uci_housing.train(), buf_size=500),
-        batch_size=1)
-    total_number = sum(1 for _ in train_reader())
-    latency_list = []
-    if args.request == "rpc":
-        client = Client()
-        client.load_client_config(args.model)
-        client.connect([args.endpoint])
-        start = time.time()
-        for data in train_reader():
-            l_start = time.time()
-            fetch_map = client.predict(feed={"x": data[0][0]}, fetch=["price"])
-            l_end = time.time()
-            latency_list.append(l_end * 1000 - l_start * 1000)
-        end = time.time()
-        return [[end - start], latency_list, [total_number]]
-    elif args.request == "http":
-        train_reader = paddle.batch(
-            paddle.reader.shuffle(
-                paddle.dataset.uci_housing.train(), buf_size=500),
-            batch_size=1)
-        start = time.time()
-        for data in train_reader():
-            l_start = time.time()
-            r = requests.post(
-                'http://{}/uci/prediction'.format(args.endpoint),
-                data={"x": data[0]})
-            l_end = time.time()
-            latency_list.append(l_end * 1000 - l_start * 1000)
-        end = time.time()
-        return [[end - start], latency_list, [total_number]]
-start = time.time()
-multi_thread_runner = MultiThreadRunner()
-result = multi_thread_runner.run(single_func, args.thread, {})
-end = time.time()
-total_cost = end - start
-avg_cost = 0
-for i in range(args.thread):
-    avg_cost += result[0][i]
-avg_cost = avg_cost / args.thread
-print("total cost: {}s".format(total_cost))
-print("each thread cost: {}s. ".format(avg_cost))
-print("qps: {}samples/s".format(args.batch_size * args.thread / total_cost))
-show_latency(result[1])
--- a/python/examples/fit_a_line/benchmark.sh
+++ b/python/examples/fit_a_line/benchmark.sh
-rm profile_log*
-export CUDA_VISIBLE_DEVICES=0,1
-export FLAGS_profile_server=1
-export FLAGS_profile_client=1
-export FLAGS_serving_latency=1
-gpu_id=0
-#save cpu and gpu utilization log
-if [ -d utilization ];then
-    rm -rf utilization
-else
-    mkdir utilization
-fi
-#start server
-$PYTHONROOT/bin/python3 -m paddle_serving_server.serve --model $1 --port 9292 --thread 4 --gpu_ids 0,1 --mem_optim  --ir_optim >  elog  2>&1 &
-sleep 5
-#warm up
-$PYTHONROOT/bin/python3 benchmark.py --thread 4 --batch_size 1 --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-echo -e "import psutil\nimport time\nwhile True:\n\tcpu_res = psutil.cpu_percent()\n\twith open('cpu.txt', 'a+') as f:\n\t\tf.write(f'{cpu_res}\\\n')\n\ttime.sleep(0.1)" > cpu.py
-for thread_num in 1 4 8 16
-do
-for batch_size in 1 4 16 64
-do
-    job_bt=`date '+%Y%m%d%H%M%S'`
-    nvidia-smi --id=0 --query-compute-apps=used_memory --format=csv -lms 100 > gpu_memory_use.log 2>&1 &
-    nvidia-smi --id=0 --query-gpu=utilization.gpu --format=csv -lms 100 > gpu_utilization.log 2>&1 &
-    rm -rf cpu.txt
-    $PYTHONROOT/bin/python3 cpu.py &
-    gpu_memory_pid=$!
-    $PYTHONROOT/bin/python3 benchmark.py --thread $thread_num --batch_size $batch_size --model $2/serving_client_conf.prototxt --request rpc > profile 2>&1
-    kill `ps -ef|grep used_memory|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep utilization.gpu|awk '{print $2}'` > /dev/null
-    kill `ps -ef|grep cpu.py|awk '{print $2}'` > /dev/null
-    echo "model_name:" $1
-    echo "thread_num:" $thread_num
-    echo "batch_size:" $batch_size
-    echo "=================Done===================="
-    echo "model_name:$1" >> profile_log_$1
-    echo "batch_size:$batch_size" >> profile_log_$1
-    job_et=`date '+%Y%m%d%H%M%S'`
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "CPU_UTILIZATION:", max}' cpu.txt >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "MAX_GPU_MEMORY:", max}' gpu_memory_use.log >> profile_log_$1
-    awk 'BEGIN {max = 0} {if(NR>1){if ($1 > max) max=$1}} END {print "GPU_UTILIZATION:", max}' gpu_utilization.log >> profile_log_$1
-    rm -rf gpu_use.log gpu_utilization.log
-    $PYTHONROOT/bin/python3 ../util/show_profile.py profile $thread_num >> profile_log_$1
-    tail -n 8 profile >> profile_log_$1
-    echo "" >> profile_log_$1
-done
-done
-#Divided log
-awk 'BEGIN{RS="\n\n"}{i++}{print > "bert_log_"i}' profile_log_$1
-mkdir bert_log && mv bert_log_* bert_log
-ps -ef|grep 'serving'|grep -v grep|cut -c 9-15 | xargs kill -9
--- a/python/examples/fit_a_line/get_data.sh
+++ b/python/examples/fit_a_line/get_data.sh
--- a/python/examples/fit_a_line/local_train.py
+++ b/python/examples/fit_a_line/local_train.py
--- a/python/examples/fit_a_line/test_client.py
+++ b/python/examples/fit_a_line/test_client.py
--- a/python/examples/fit_a_line/test_httpclient.py
+++ b/python/examples/fit_a_line/test_httpclient.py
--- a/python/examples/fit_a_line/test_multi_process_client.py
+++ b/python/examples/fit_a_line/test_multi_process_client.py
--- a/python/examples/fit_a_line/test_server.py
+++ b/python/examples/fit_a_line/test_server.py
--- a/python/examples/imagenet/README.md
+++ b/python/examples/imagenet/README.md
--- a/python/examples/imagenet/README_CN.md
+++ b/python/examples/imagenet/README_CN.md
--- a/python/examples/imagenet/benchmark.py
+++ b/python/examples/imagenet/benchmark.py
--- a/python/examples/imagenet/benchmark.sh
+++ b/python/examples/imagenet/benchmark.sh
--- a/python/examples/imagenet/daisy.jpg
+++ b/python/examples/imagenet/daisy.jpg
--- a/python/examples/imagenet/data/n01440764_10026.JPEG
+++ b/python/examples/imagenet/data/n01440764_10026.JPEG
--- a/python/examples/imagenet/flower.jpg
+++ b/python/examples/imagenet/flower.jpg
--- a/python/examples/imagenet/get_model.sh
+++ b/python/examples/imagenet/get_model.sh
--- a/python/examples/imagenet/imagenet.label
+++ b/python/examples/imagenet/imagenet.label
--- a/python/examples/imagenet/resnet50_http_client.py
+++ b/python/examples/imagenet/resnet50_http_client.py
--- a/python/examples/imagenet/resnet50_rpc_client.py
+++ b/python/examples/imagenet/resnet50_rpc_client.py
--- a/python/examples/imagenet/test_image_reader.py
+++ b/python/examples/imagenet/test_image_reader.py
--- a/python/examples/imdb/README.md
+++ b/python/examples/imdb/README.md
--- a/python/examples/imdb/README_CN.md
+++ b/python/examples/imdb/README_CN.md
--- a/python/examples/imdb/abtest_client.py
+++ b/python/examples/imdb/abtest_client.py
--- a/python/examples/imdb/abtest_get_data.py
+++ b/python/examples/imdb/abtest_get_data.py
--- a/python/examples/imdb/benchmark.py
+++ b/python/examples/imdb/benchmark.py
--- a/python/examples/imdb/benchmark.sh
+++ b/python/examples/imdb/benchmark.sh
--- a/python/examples/imdb/clean_data.sh
+++ b/python/examples/imdb/clean_data.sh
--- a/python/examples/imdb/get_data.sh
+++ b/python/examples/imdb/get_data.sh
--- a/python/examples/imdb/imdb_reader.py
+++ b/python/examples/imdb/imdb_reader.py
--- a/python/examples/imdb/local_train.py
+++ b/python/examples/imdb/local_train.py
--- a/python/examples/imdb/nets.py
+++ b/python/examples/imdb/nets.py
--- a/python/examples/imdb/test_client.py
+++ b/python/examples/imdb/test_client.py
--- a/python/examples/imdb/test_http_client.py
+++ b/python/examples/imdb/test_http_client.py
--- a/python/examples/lac/README.md
+++ b/python/examples/lac/README.md
--- a/python/examples/lac/README_CN.md
+++ b/python/examples/lac/README_CN.md
--- a/python/examples/lac/benchmark.py
+++ b/python/examples/lac/benchmark.py
--- a/python/examples/lac/lac_client.py
+++ b/python/examples/lac/lac_client.py
--- a/python/examples/lac/lac_http_client.py
+++ b/python/examples/lac/lac_http_client.py
--- a/python/examples/lac/lac_reader.py
+++ b/python/examples/lac/lac_reader.py
--- a/python/examples/lac/utils.py
+++ b/python/examples/lac/utils.py
--- a/python/examples/low_precision/resnet50/README.md
+++ b/python/examples/low_precision/resnet50/README.md
--- a/python/examples/low_precision/resnet50/README_CN.md
+++ b/python/examples/low_precision/resnet50/README_CN.md
--- a/python/examples/low_precision/resnet50/daisy.jpg
+++ b/python/examples/low_precision/resnet50/daisy.jpg
--- a/python/examples/low_precision/resnet50/resnet50_client.py
+++ b/python/examples/low_precision/resnet50/resnet50_client.py
--- a/python/examples/mobilenet/README.md
+++ b/python/examples/mobilenet/README.md
--- a/python/examples/mobilenet/README_CN.md
+++ b/python/examples/mobilenet/README_CN.md
--- a/python/examples/mobilenet/daisy.jpg
+++ b/python/examples/mobilenet/daisy.jpg
--- a/python/examples/mobilenet/mobilenet_tutorial.py
+++ b/python/examples/mobilenet/mobilenet_tutorial.py
--- a/python/examples/ocr/README.md
+++ b/python/examples/ocr/README.md
--- a/python/examples/ocr/README_CN.md
+++ b/python/examples/ocr/README_CN.md
--- a/python/examples/ocr/det_debugger_server.py
+++ b/python/examples/ocr/det_debugger_server.py
--- a/python/examples/ocr/det_web_server.py
+++ b/python/examples/ocr/det_web_server.py
--- a/python/examples/ocr/imgs/1.jpg
+++ b/python/examples/ocr/imgs/1.jpg
--- a/python/examples/ocr/ocr_cpp_client.py
+++ b/python/examples/ocr/ocr_cpp_client.py
--- a/python/examples/ocr/ocr_debugger_server.py
+++ b/python/examples/ocr/ocr_debugger_server.py
--- a/python/examples/ocr/ocr_web_client.py
+++ b/python/examples/ocr/ocr_web_client.py
--- a/python/examples/ocr/ocr_web_server.py
+++ b/python/examples/ocr/ocr_web_server.py
--- a/python/examples/ocr/rec_debugger_server.py
+++ b/python/examples/ocr/rec_debugger_server.py
--- a/python/examples/ocr/rec_img/ch_doc3.jpg
+++ b/python/examples/ocr/rec_img/ch_doc3.jpg
--- a/python/examples/ocr/rec_web_client.py
+++ b/python/examples/ocr/rec_web_client.py
--- a/python/examples/ocr/rec_web_server.py
+++ b/python/examples/ocr/rec_web_server.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/README.md
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/README.md
--- a/python/examples/pipeline/PaddleClas/DarkNet53/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/README_CN.md
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/DarkNet53/benchmark_cfg.yaml
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/benchmark_cfg.yaml
--- a/python/examples/pipeline/PaddleClas/DarkNet53/config.yml
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/config.yml
--- a/python/examples/pipeline/PaddleClas/DarkNet53/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/DarkNet53/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/get_model.sh
--- a/python/examples/pipeline/PaddleClas/DarkNet53/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/imagenet.label
--- a/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/DarkNet53/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/DarkNet53/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/README.md
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/README.md
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/README_CN.md
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/config.yml
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/config.yml
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/get_model.sh
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/imagenet.label
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/HRNet_W18_C/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/HRNet_W18_C/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV1/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV1/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV2/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV2/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/README_CN.md
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/config.yml
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/config.yml
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/get_model.sh
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/imagenet.label
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README.md
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_KL/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet50_vd_PACT/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/README.md
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/README.md
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/config.yml
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/config.yml
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ResNet_V2_50/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ResNet_V2_50/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README.md
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README.md
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README_CN.md
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/README_CN.md
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.sh
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/benchmark.sh
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/config.yml
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/config.yml
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/cpu_utilization.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/cpu_utilization.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/daisy.jpg
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/daisy.jpg
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/get_model.sh
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/get_model.sh
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/imagenet.label
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/imagenet.label
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_rpc_client.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/pipeline_rpc_client.py
--- a/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/resnet50_web_service.py
+++ b/python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0/resnet50_web_service.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/README.md
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/README.md
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/config.yml
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/config.yml
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/faster_rcnn/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/faster_rcnn/web_service.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README.md
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README.md
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README_CN.md
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/README_CN.md
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/config.yml
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/config.yml
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/ppyolo_mbv3/web_service.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/000000570688.jpg
+++ b/python/examples/pipeline/PaddleDetection/yolov3/000000570688.jpg
--- a/python/examples/pipeline/PaddleDetection/yolov3/README.md
+++ b/python/examples/pipeline/PaddleDetection/yolov3/README.md
--- a/python/examples/pipeline/PaddleDetection/yolov3/README_CN.md
+++ b/python/examples/pipeline/PaddleDetection/yolov3/README_CN.md
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark.sh
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark.sh
--- a/python/examples/pipeline/PaddleDetection/yolov3/benchmark_config.yaml
+++ b/python/examples/pipeline/PaddleDetection/yolov3/benchmark_config.yaml
--- a/python/examples/pipeline/PaddleDetection/yolov3/config.yml
+++ b/python/examples/pipeline/PaddleDetection/yolov3/config.yml
--- a/python/examples/pipeline/PaddleDetection/yolov3/label_list.txt
+++ b/python/examples/pipeline/PaddleDetection/yolov3/label_list.txt
--- a/python/examples/pipeline/PaddleDetection/yolov3/pipeline_http_client.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/pipeline_http_client.py
--- a/python/examples/pipeline/PaddleDetection/yolov3/web_service.py
+++ b/python/examples/pipeline/PaddleDetection/yolov3/web_service.py
--- a/python/examples/pipeline/bert/README.md
+++ b/python/examples/pipeline/bert/README.md
--- a/python/examples/pipeline/bert/README_CN.md
+++ b/python/examples/pipeline/bert/README_CN.md
--- a/python/examples/pipeline/bert/benchmark.py
+++ b/python/examples/pipeline/bert/benchmark.py
--- a/python/examples/pipeline/bert/benchmark.sh
+++ b/python/examples/pipeline/bert/benchmark.sh
--- a/python/examples/pipeline/bert/config.yml
+++ b/python/examples/pipeline/bert/config.yml
--- a/python/examples/pipeline/bert/get_data.sh
+++ b/python/examples/pipeline/bert/get_data.sh
--- a/python/examples/pipeline/bert/pipeline_rpc_client.py
+++ b/python/examples/pipeline/bert/pipeline_rpc_client.py
--- a/python/examples/pipeline/bert/web_service.py
+++ b/python/examples/pipeline/bert/web_service.py
--- a/python/examples/pipeline/imagenet/README.md
+++ b/python/examples/pipeline/imagenet/README.md
--- a/python/examples/pipeline/imagenet/README_CN.md
+++ b/python/examples/pipeline/imagenet/README_CN.md
--- a/python/examples/pipeline/imagenet/config.yml
+++ b/python/examples/pipeline/imagenet/config.yml
--- a/python/examples/pipeline/imagenet/daisy.jpg
+++ b/python/examples/pipeline/imagenet/daisy.jpg
--- a/python/examples/pipeline/imagenet/get_model.sh
+++ b/python/examples/pipeline/imagenet/get_model.sh
--- a/python/examples/pipeline/imagenet/imagenet.label
+++ b/python/examples/pipeline/imagenet/imagenet.label
--- a/python/examples/pipeline/imagenet/pipeline_rpc_client.py
+++ b/python/examples/pipeline/imagenet/pipeline_rpc_client.py
--- a/python/examples/pipeline/imagenet/resnet50_web_service.py
+++ b/python/examples/pipeline/imagenet/resnet50_web_service.py
--- a/python/examples/pipeline/imdb_model_ensemble/README.md
+++ b/python/examples/pipeline/imdb_model_ensemble/README.md
--- a/python/examples/pipeline/imdb_model_ensemble/README_CN.md
+++ b/python/examples/pipeline/imdb_model_ensemble/README_CN.md
--- a/python/examples/pipeline/imdb_model_ensemble/analyse.py
+++ b/python/examples/pipeline/imdb_model_ensemble/analyse.py
--- a/python/examples/pipeline/imdb_model_ensemble/analyse.yaml
+++ b/python/examples/pipeline/imdb_model_ensemble/analyse.yaml
--- a/python/examples/pipeline/imdb_model_ensemble/config.yml
+++ b/python/examples/pipeline/imdb_model_ensemble/config.yml
--- a/python/examples/pipeline/imdb_model_ensemble/get_data.sh
+++ b/python/examples/pipeline/imdb_model_ensemble/get_data.sh
--- a/python/examples/pipeline/imdb_model_ensemble/test_pipeline_client.py
+++ b/python/examples/pipeline/imdb_model_ensemble/test_pipeline_client.py
--- a/python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
+++ b/python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
--- a/python/examples/pipeline/ocr/README.md
+++ b/python/examples/pipeline/ocr/README.md
--- a/python/examples/pipeline/ocr/README_CN.md
+++ b/python/examples/pipeline/ocr/README_CN.md
--- a/python/examples/pipeline/ocr/benchmark.py
+++ b/python/examples/pipeline/ocr/benchmark.py
--- a/python/examples/pipeline/ocr/benchmark.sh
+++ b/python/examples/pipeline/ocr/benchmark.sh
--- a/python/examples/pipeline/ocr/config.yml
+++ b/python/examples/pipeline/ocr/config.yml
--- a/python/examples/pipeline/ocr/imgs/1.jpg
+++ b/python/examples/pipeline/ocr/imgs/1.jpg
--- a/python/examples/pipeline/ocr/pipeline_http_client.py
+++ b/python/examples/pipeline/ocr/pipeline_http_client.py
--- a/python/examples/pipeline/ocr/pipeline_rpc_client.py
+++ b/python/examples/pipeline/ocr/pipeline_rpc_client.py
--- a/python/examples/pipeline/ocr/web_service.py
+++ b/python/examples/pipeline/ocr/web_service.py
--- a/python/examples/pipeline/simple_web_service/README.md
+++ b/python/examples/pipeline/simple_web_service/README.md
--- a/python/examples/pipeline/simple_web_service/README_CN.md
+++ b/python/examples/pipeline/simple_web_service/README_CN.md
--- a/python/examples/pipeline/simple_web_service/benchmark.py
+++ b/python/examples/pipeline/simple_web_service/benchmark.py
--- a/python/examples/pipeline/simple_web_service/benchmark.sh
+++ b/python/examples/pipeline/simple_web_service/benchmark.sh
--- a/python/examples/pipeline/simple_web_service/config.yml
+++ b/python/examples/pipeline/simple_web_service/config.yml
--- a/python/examples/pipeline/simple_web_service/get_data.sh
+++ b/python/examples/pipeline/simple_web_service/get_data.sh
--- a/python/examples/pipeline/simple_web_service/web_service.py
+++ b/python/examples/pipeline/simple_web_service/web_service.py
--- a/python/examples/pipeline/simple_web_service/web_service_java.py
+++ b/python/examples/pipeline/simple_web_service/web_service_java.py
--- a/python/examples/resnet_v2_50/README.md
+++ b/python/examples/resnet_v2_50/README.md
--- a/python/examples/resnet_v2_50/README_CN.md
+++ b/python/examples/resnet_v2_50/README_CN.md
--- a/python/examples/resnet_v2_50/benchmark.py
+++ b/python/examples/resnet_v2_50/benchmark.py
--- a/python/examples/resnet_v2_50/benchmark.sh
+++ b/python/examples/resnet_v2_50/benchmark.sh
--- a/python/examples/resnet_v2_50/daisy.jpg
+++ b/python/examples/resnet_v2_50/daisy.jpg
--- a/python/examples/resnet_v2_50/resnet50_debug.py
+++ b/python/examples/resnet_v2_50/resnet50_debug.py
--- a/python/examples/resnet_v2_50/resnet50_v2_tutorial.py
+++ b/python/examples/resnet_v2_50/resnet50_v2_tutorial.py
--- a/python/examples/resnet_v2_50/run_benchmark.sh
+++ b/python/examples/resnet_v2_50/run_benchmark.sh
--- a/python/examples/senta/README.md
+++ b/python/examples/senta/README.md
--- a/python/examples/senta/README_CN.md
+++ b/python/examples/senta/README_CN.md
--- a/python/examples/senta/get_data.sh
+++ b/python/examples/senta/get_data.sh
--- a/python/examples/senta/senta_web_service.py
+++ b/python/examples/senta/senta_web_service.py
--- a/python/examples/unet_for_image_seg/N0060.jpg
+++ b/python/examples/unet_for_image_seg/N0060.jpg
--- a/python/examples/unet_for_image_seg/README.md
+++ b/python/examples/unet_for_image_seg/README.md
--- a/python/examples/unet_for_image_seg/README_CN.md
+++ b/python/examples/unet_for_image_seg/README_CN.md
--- a/python/examples/unet_for_image_seg/seg_client.py
+++ b/python/examples/unet_for_image_seg/seg_client.py
--- a/python/examples/unet_for_image_seg/unet_benchmark/README.md
+++ b/python/examples/unet_for_image_seg/unet_benchmark/README.md
--- a/python/examples/unet_for_image_seg/unet_benchmark/img_data/N0060.jpg
+++ b/python/examples/unet_for_image_seg/unet_benchmark/img_data/N0060.jpg
--- a/python/examples/unet_for_image_seg/unet_benchmark/launch_benckmark.sh
+++ b/python/examples/unet_for_image_seg/unet_benchmark/launch_benckmark.sh
--- a/python/examples/unet_for_image_seg/unet_benchmark/unet_benchmark.py
+++ b/python/examples/unet_for_image_seg/unet_benchmark/unet_benchmark.py
--- a/python/examples/util/README.md
+++ b/python/examples/util/README.md
--- a/python/examples/util/README_CN.md
+++ b/python/examples/util/README_CN.md
--- a/python/examples/util/get_acc.py
+++ b/python/examples/util/get_acc.py
--- a/python/examples/util/show_profile.py
+++ b/python/examples/util/show_profile.py
--- a/python/examples/util/timeline_trace.py
+++ b/python/examples/util/timeline_trace.py
--- a/python/examples/xpu/bert/README.md
+++ b/python/examples/xpu/bert/README.md
--- a/python/examples/xpu/bert/bert_client.py
+++ b/python/examples/xpu/bert/bert_client.py
--- a/python/examples/xpu/bert/chinese_bert_reader.py
+++ b/python/examples/xpu/bert/chinese_bert_reader.py
--- a/python/examples/xpu/bert/get_data.sh
+++ b/python/examples/xpu/bert/get_data.sh
--- a/python/examples/xpu/ernie/README.md
+++ b/python/examples/xpu/ernie/README.md
--- a/python/examples/xpu/ernie/chinese_ernie_reader.py
+++ b/python/examples/xpu/ernie/chinese_ernie_reader.py
--- a/python/examples/xpu/ernie/ernie_client.py
+++ b/python/examples/xpu/ernie/ernie_client.py
--- a/python/examples/xpu/ernie/get_data.sh
+++ b/python/examples/xpu/ernie/get_data.sh
--- a/python/examples/xpu/fit_a_line_xpu/README.md
+++ b/python/examples/xpu/fit_a_line_xpu/README.md
--- a/python/examples/xpu/fit_a_line_xpu/README_CN.md
+++ b/python/examples/xpu/fit_a_line_xpu/README_CN.md
--- a/python/examples/xpu/fit_a_line_xpu/benchmark.py
+++ b/python/examples/xpu/fit_a_line_xpu/benchmark.py
--- a/python/examples/xpu/fit_a_line_xpu/get_data.sh
+++ b/python/examples/xpu/fit_a_line_xpu/get_data.sh
--- a/python/examples/xpu/fit_a_line_xpu/local_train.py
+++ b/python/examples/xpu/fit_a_line_xpu/local_train.py
--- a/python/examples/xpu/fit_a_line_xpu/test_client.py
+++ b/python/examples/xpu/fit_a_line_xpu/test_client.py
--- a/python/examples/xpu/resnet_v2_50_xpu/README.md
+++ b/python/examples/xpu/resnet_v2_50_xpu/README.md
--- a/python/examples/xpu/resnet_v2_50_xpu/README_CN.md
+++ b/python/examples/xpu/resnet_v2_50_xpu/README_CN.md
--- a/python/examples/xpu/resnet_v2_50_xpu/daisy.jpg
+++ b/python/examples/xpu/resnet_v2_50_xpu/daisy.jpg
--- a/python/examples/xpu/resnet_v2_50_xpu/localpredict.py
+++ b/python/examples/xpu/resnet_v2_50_xpu/localpredict.py
--- a/python/examples/xpu/resnet_v2_50_xpu/resnet50_client.py
+++ b/python/examples/xpu/resnet_v2_50_xpu/resnet50_client.py
--- a/python/examples/xpu/vgg19/README.md
+++ b/python/examples/xpu/vgg19/README.md
--- a/python/examples/xpu/vgg19/daisy.jpg
+++ b/python/examples/xpu/vgg19/daisy.jpg
--- a/python/examples/xpu/vgg19/vgg19_client.py
+++ b/python/examples/xpu/vgg19/vgg19_client.py
--- a/python/examples/yolov4/000000570688.jpg
+++ b/python/examples/yolov4/000000570688.jpg
--- a/python/examples/yolov4/README.md
+++ b/python/examples/yolov4/README.md
--- a/python/examples/yolov4/README_CN.md
+++ b/python/examples/yolov4/README_CN.md
--- a/python/examples/yolov4/label_list.txt
+++ b/python/examples/yolov4/label_list.txt
--- a/python/examples/yolov4/test_client.py
+++ b/python/examples/yolov4/test_client.py