Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
book
提交
5d93ad3c
B
book
项目概览
PaddlePaddle
/
book
通知
16
Star
4
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
40
列表
看板
标记
里程碑
合并请求
37
Wiki
5
Wiki
分析
仓库
DevOps
项目成员
Pages
B
book
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
40
Issue
40
列表
看板
标记
里程碑
合并请求
37
合并请求
37
Pages
分析
分析
仓库分析
DevOps
Wiki
5
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
5d93ad3c
编写于
9月 13, 2017
作者:
M
Mimee
提交者:
GitHub
9月 13, 2017
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Update README.md
Some language changes
上级
d598fe25
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
16 addition
and
16 deletion
+16
-16
serve/README.md
serve/README.md
+16
-16
未找到文件。
serve/README.md
浏览文件 @
5d93ad3c
# Inference Server Example
The inference server can be used to
inference any model trained by
The inference server can be used to
perform inference on any model trained on
PaddlePaddle. It provides an HTTP endpoint.
## Run
The inference server reads a trained model (a topology file and a
parameter file) and serves HTTP request at port
`8000`
.
parameter file) and serves HTTP request
s
at port
`8000`
.
We will first show how to obtain the PaddlePaddle model, and then how
to start the server.
...
...
@@ -14,12 +14,12 @@ to start the server.
We will use Docker to run the demo, if you are not familiar with
Docker, please checkout
this
[
tutorial
](
https://github.com/PaddlePaddle/Paddle/wiki/TLDR-for-new-docker-user
)
.
[
TLDR
](
https://github.com/PaddlePaddle/Paddle/wiki/TLDR-for-new-docker-user
)
.
### Obtain the PaddlePaddle Model
Neural network model in PaddlePaddle contains two parts,
the
parameter, and the topology
.
A neural network model in PaddlePaddle contains two parts:
the
**parameter**
and the
**topology**
.
A PaddlePaddle training script contains the neural network topology,
which is represented by layers. For example,
...
...
@@ -45,7 +45,7 @@ PaddlePaddle stores the topology and parameter separately.
1.
To serialize a topology, we need to create a topology instance
explicitly by the outputs of the neural network. Then, invoke
`serialize_for_inference`
method.
The example code is
`serialize_for_inference`
method.
```
python
# Save the inference topology to protobuf.
...
...
@@ -55,17 +55,17 @@ PaddlePaddle stores the topology and parameter separately.
```
2.
To save a parameter, we need to invoke
`to_tar`
method in Parameter
class.
The example code is,
class.
```
python
with
open
(
'param.tar'
,
'w'
)
as
f
:
params
.
to_tar
(
f
)
```
After
we serialize the parameter and topology
to two files, we could
use th
at two files
to set up an inference server.
After
serializing the parameter and topology in
to two files, we could
use th
em
to set up an inference server.
For a working example, please see
[
here
](
https://github.com/reyoung/paddle_mnist_v2_demo/blob/master/train.py
)
.
For a working example, please see
[
train.py
](
https://github.com/reyoung/paddle_mnist_v2_demo/blob/master/train.py
)
.
### Start the Server
...
...
@@ -79,12 +79,12 @@ docker run --name paddle_serve -v `pwd`:/data -d -p 8000:80 -e WITH_GPU=0 paddle
```
The above command will mount the current working directory to the
`/data`
directory inside the docker container. The inference server
`/data
/
`
directory inside the docker container. The inference server
will load the model topology and parameters that we just created from
there.
To run the inference server with GPU support, please
install
[
nvidia-docker
](
https://github.com/NVIDIA/nvidia-docker
)
To run the inference server with GPU support, please
make sure you have
[
nvidia-docker
](
https://github.com/NVIDIA/nvidia-docker
)
first, and run:
```
bash
...
...
@@ -103,7 +103,7 @@ add `Content-Type` request header as `Content-Type: application/json`.
The request json object is a single json dictionay object, whose key
is the layer name of input data. The type of the corresponding value
is decided by the data type. For most cases the corresponding value
will be a list of floats. For completeness we will list all data types
will be a list of floats. For completeness
,
we will list all data types
below:
There are tweleve data types supported by PaddePaddle:
...
...
@@ -188,8 +188,8 @@ The response is a json object, too. The example of return data are:
}
```
The `code` and `message` represent the status of the request. The
`data`
are
the outputs of the neural network; they could be a
Here, `code` and `message` represent the status of the request.
`data`
corresponds to
the outputs of the neural network; they could be a
probability of each class, could be the IDs of output sentence, and so
on.
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录