From 21032ebc51b0142c5c3ce8410c93bf5556c2b1a2 Mon Sep 17 00:00:00 2001 From: Jiawei Wang Date: Tue, 31 Mar 2020 11:25:51 +0800 Subject: [PATCH] Update BERT_10_MINS.md --- doc/BERT_10_MINS.md | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) diff --git a/doc/BERT_10_MINS.md b/doc/BERT_10_MINS.md index f02a8529..e5faed74 100644 --- a/doc/BERT_10_MINS.md +++ b/doc/BERT_10_MINS.md @@ -8,6 +8,7 @@ The goal of Bert-As-Service is to give a sentence, and the service can represent Paddle Serving supports various models trained based on Paddle, and saves the serviceable model by specifying the input and output variables of the model. For convenience, we can load a trained bert Chinese model from paddlehub and save a deployable service with two lines of code. The server and client configurations are placed in the `bert_seq20_model` and` bert_seq20_client` folders, respectively. +[//file]:#bert_10.py ``` python import paddlehub as hub model_name = "bert_chinese_L-12_H-768_A-12" @@ -27,6 +28,7 @@ serving_io.save_model("bert_seq20_model", "bert_seq20_client", #### Step2: Launch Service +[//file]:#server.sh ``` shell python -m paddle_serving_server_gpu.serve --model bert_seq20_model --thread 10 --port 9292 --gpu_ids 0 ``` @@ -43,6 +45,7 @@ Paddle Serving has many built-in corresponding data preprocessing logics. For th Install paddle_serving_app +[//file]:#pip_app.sh ```shell pip install paddle_serving_app ``` @@ -51,6 +54,7 @@ pip install paddle_serving_app the script of client side bert_client.py is as follow: +[//file]:#bert_client.py ``` python import os import sys @@ -71,6 +75,7 @@ for line in sys.stdin: run +[//file]:#bert_10_cli.sh ```shell cat data.txt | python bert_client.py ``` @@ -82,3 +87,18 @@ read samples from data.txt, print results at the standard output. We tested the performance of Bert-As-Service based on Padde Serving based on V100 and compared it with the Bert-As-Service based on Tensorflow. From the perspective of user configuration, we used the same batch size and concurrent number for stress testing. The overall throughput performance data obtained under 4 V100s is as follows. ![4v100_bert_as_service_benchmark](4v100_bert_as_service_benchmark.png) + + -- GitLab