Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
Serving
提交
d5f1da66
S
Serving
项目概览
PaddlePaddle
/
Serving
大约 1 年 前同步成功
通知
185
Star
833
Fork
253
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
105
列表
看板
标记
里程碑
合并请求
10
Wiki
2
Wiki
分析
仓库
DevOps
项目成员
Pages
S
Serving
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
105
Issue
105
列表
看板
标记
里程碑
合并请求
10
合并请求
10
Pages
分析
分析
仓库分析
DevOps
Wiki
2
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
体验新版 GitCode,发现更多精彩内容 >>
未验证
提交
d5f1da66
编写于
4月 06, 2021
作者:
T
TeslaZhao
提交者:
GitHub
4月 06, 2021
浏览文件
操作
浏览文件
下载
差异文件
Merge branch 'develop' into grpc-fix
上级
505cf16b
ffe4eb02
变更
18
隐藏空白更改
内联
并排
Showing
18 changed file
with
101 addition
and
110 deletion
+101
-110
python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
.../examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
+1
-1
python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
...amples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
+1
-1
python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
+1
-2
python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
.../examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
+1
-2
python/examples/detection/ssd_vgg16_300_240e_voc/README.md
python/examples/detection/ssd_vgg16_300_240e_voc/README.md
+1
-2
python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
...on/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
+1
-2
python/examples/ocr/ocr_debugger_server.py
python/examples/ocr/ocr_debugger_server.py
+1
-1
python/examples/ocr/rec_debugger_server.py
python/examples/ocr/rec_debugger_server.py
+2
-1
python/examples/pipeline/bert/pipeline_rpc_client.py
python/examples/pipeline/bert/pipeline_rpc_client.py
+25
-15
python/examples/pipeline/bert/web_service.py
python/examples/pipeline/bert/web_service.py
+4
-6
python/examples/pipeline/imagenet/resnet50_web_service.py
python/examples/pipeline/imagenet/resnet50_web_service.py
+1
-4
python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
...ples/pipeline/imdb_model_ensemble/test_pipeline_server.py
+4
-7
python/examples/pipeline/ocr/web_service.py
python/examples/pipeline/ocr/web_service.py
+4
-7
python/examples/pipeline/simple_web_service/web_service.py
python/examples/pipeline/simple_web_service/web_service.py
+8
-7
python/examples/pipeline/simple_web_service/web_service_java.py
.../examples/pipeline/simple_web_service/web_service_java.py
+1
-4
python/examples/senta/senta_web_service.py
python/examples/senta/senta_web_service.py
+0
-4
python/examples/xpu/fit_a_line_xpu/test_server.py
python/examples/xpu/fit_a_line_xpu/test_server.py
+2
-1
python/paddle_serving_app/local_predict.py
python/paddle_serving_app/local_predict.py
+43
-43
未找到文件。
python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README.md
浏览文件 @
d5f1da66
...
...
@@ -10,7 +10,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### Start the service
```
tar xf faster_rcnn_hrnetv2p_w18_1x.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
This model support TensorRT, if you want a faster inference, please use
`--use_trt`
.
...
...
python/examples/detection/faster_rcnn_hrnetv2p_w18_1x/README_CN.md
浏览文件 @
d5f1da66
...
...
@@ -11,7 +11,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### 启动服务
```
tar xf faster_rcnn_hrnetv2p_w18_1x.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
该模型支持TensorRT,如果想要更快的预测速度,可以开启
`--use_trt`
选项。
...
...
python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README.md
浏览文件 @
d5f1da66
...
...
@@ -10,7 +10,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### Start the service
```
tar xf fcos_dcn_r50_fpn_1x_coco.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
This model support TensorRT, if you want a faster inference, please use
`--use_trt`
.
...
...
@@ -18,4 +18,3 @@ This model support TensorRT, if you want a faster inference, please use `--use_t
```
python test_client.py 000000570688.jpg
```
python/examples/detection/fcos_dcn_r50_fpn_1x_coco/README_CN.md
浏览文件 @
d5f1da66
...
...
@@ -11,7 +11,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### 启动服务
```
tar xf fcos_dcn_r50_fpn_1x_coco.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
该模型支持TensorRT,如果想要更快的预测速度,可以开启
`--use_trt`
选项。
...
...
@@ -20,4 +20,3 @@ python -m paddle_serving_server_gpu.serve --model serving_server --port 9494 --g
```
python test_client.py 000000570688.jpg
```
python/examples/detection/ssd_vgg16_300_240e_voc/README.md
浏览文件 @
d5f1da66
...
...
@@ -10,7 +10,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### Start the service
```
tar xf ssd_vgg16_300_240e_voc.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
This model support TensorRT, if you want a faster inference, please use
`--use_trt`
.
...
...
@@ -18,4 +18,3 @@ This model support TensorRT, if you want a faster inference, please use `--use_t
```
python test_client.py 000000570688.jpg
```
python/examples/detection/ssd_vgg16_300_240e_voc/README_CN.md
浏览文件 @
d5f1da66
...
...
@@ -11,7 +11,7 @@ wget --no-check-certificate https://paddle-serving.bj.bcebos.com/pddet_demo/2.0/
### 启动服务
```
tar xf ssd_vgg16_300_240e_voc.tar
python -m paddle_serving_server
_gpu
.serve --model serving_server --port 9494 --gpu_ids 0
python -m paddle_serving_server.serve --model serving_server --port 9494 --gpu_ids 0
```
该模型支持TensorRT,如果想要更快的预测速度,可以开启
`--use_trt`
选项。
...
...
@@ -20,4 +20,3 @@ python -m paddle_serving_server_gpu.serve --model serving_server --port 9494 --g
```
python test_client.py 000000570688.jpg
```
python/examples/ocr/ocr_debugger_server.py
浏览文件 @
d5f1da66
...
...
@@ -107,7 +107,7 @@ ocr_service.prepare_server(workdir="workdir", port=9292)
ocr_service
.
init_det_debugger
(
det_model_config
=
"ocr_det_model"
)
if
sys
.
argv
[
1
]
==
'gpu'
:
ocr_service
.
set_gpus
(
"2"
)
ocr_service
.
run_debugger_service
(
gpu
=
True
)
ocr_service
.
run_debugger_service
(
gpu
=
True
)
elif
sys
.
argv
[
1
]
==
'cpu'
:
ocr_service
.
run_debugger_service
()
ocr_service
.
run_web_service
()
python/examples/ocr/rec_debugger_server.py
浏览文件 @
d5f1da66
...
...
@@ -71,7 +71,8 @@ ocr_service.load_model_config("ocr_rec_model")
if
sys
.
argv
[
1
]
==
'gpu'
:
ocr_service
.
set_gpus
(
"0"
)
ocr_service
.
init_rec
()
ocr_service
.
prepare_server
(
workdir
=
"workdir"
,
port
=
9292
,
device
=
"gpu"
,
gpuid
=
0
)
ocr_service
.
prepare_server
(
workdir
=
"workdir"
,
port
=
9292
,
device
=
"gpu"
,
gpuid
=
0
)
elif
sys
.
argv
[
1
]
==
'cpu'
:
ocr_service
.
init_rec
()
ocr_service
.
prepare_server
(
workdir
=
"workdir"
,
port
=
9292
,
device
=
"cpu"
)
...
...
python/examples/pipeline/bert/pipeline_rpc_client.py
浏览文件 @
d5f1da66
# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
import
sys
import
os
import
yaml
import
requests
import
time
import
json
try
:
from
paddle_serving_server_gpu.pipeline
import
PipelineClient
except
ImportError
:
from
paddle_serving_server.pipeline
import
PipelineClient
from
paddle_serving_server.pipeline
import
PipelineClient
import
numpy
as
np
client
=
PipelineClient
()
client
.
connect
([
'127.0.0.1:9998'
])
batch_size
=
101
with
open
(
"data-c.txt"
,
'r'
)
as
fin
:
lines
=
fin
.
readlines
()
start_idx
=
0
while
start_idx
<
len
(
lines
):
end_idx
=
min
(
len
(
lines
),
start_idx
+
batch_size
)
feed
=
{}
for
i
in
range
(
start_idx
,
end_idx
):
feed
[
str
(
i
-
start_idx
)]
=
lines
[
i
]
ret
=
client
.
predict
(
feed_dict
=
feed
,
fetch
=
[
"res"
])
print
(
ret
)
start_idx
+=
batch_size
lines
=
fin
.
readlines
()
start_idx
=
0
while
start_idx
<
len
(
lines
):
end_idx
=
min
(
len
(
lines
),
start_idx
+
batch_size
)
feed
=
{}
for
i
in
range
(
start_idx
,
end_idx
):
feed
[
str
(
i
-
start_idx
)]
=
lines
[
i
]
ret
=
client
.
predict
(
feed_dict
=
feed
,
fetch
=
[
"res"
])
print
(
ret
)
start_idx
+=
batch_size
python/examples/pipeline/bert/web_service.py
浏览文件 @
d5f1da66
...
...
@@ -11,10 +11,7 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
try
:
from
paddle_serving_server_gpu.web_service
import
WebService
,
Op
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
,
Op
from
paddle_serving_server.web_service
import
WebService
,
Op
import
logging
import
numpy
as
np
import
sys
...
...
@@ -37,7 +34,8 @@ class BertOp(Op):
for
i
in
range
(
batch_size
):
feed_dict
=
self
.
reader
.
process
(
input_dict
[
str
(
i
)].
encode
(
"utf-8"
))
for
key
in
feed_dict
.
keys
():
feed_dict
[
key
]
=
np
.
array
(
feed_dict
[
key
]).
reshape
((
1
,
len
(
feed_dict
[
key
]),
1
))
feed_dict
[
key
]
=
np
.
array
(
feed_dict
[
key
]).
reshape
(
(
1
,
len
(
feed_dict
[
key
]),
1
))
feed_res
.
append
(
feed_dict
)
feed_dict
=
{}
for
key
in
feed_res
[
0
].
keys
():
...
...
@@ -57,5 +55,5 @@ class BertService(WebService):
bert_service
=
BertService
(
name
=
"bert"
)
bert_service
.
prepare_pipeline_config
(
"config
2
.yml"
)
bert_service
.
prepare_pipeline_config
(
"config.yml"
)
bert_service
.
run_service
()
python/examples/pipeline/imagenet/resnet50_web_service.py
浏览文件 @
d5f1da66
...
...
@@ -13,10 +13,7 @@
# limitations under the License.
import
sys
from
paddle_serving_app.reader
import
Sequential
,
URL2Image
,
Resize
,
CenterCrop
,
RGB2BGR
,
Transpose
,
Div
,
Normalize
,
Base64ToImage
try
:
from
paddle_serving_server.web_service
import
WebService
,
Op
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
,
Op
from
paddle_serving_server.web_service
import
WebService
,
Op
import
logging
import
numpy
as
np
import
base64
,
cv2
...
...
python/examples/pipeline/imdb_model_ensemble/test_pipeline_server.py
浏览文件 @
d5f1da66
...
...
@@ -12,17 +12,14 @@
# See the License for the specific language governing permissions and
# limitations under the License.
# pylint: disable=doc-string-missing
import
numpy
as
np
from
paddle_serving_app.reader.imdb_reader
import
IMDBDataset
import
logging
from
paddle_serving_server.web_service
import
WebService
from
paddle_serving_server.pipeline
import
Op
,
RequestOp
,
ResponseOp
from
paddle_serving_server.pipeline
import
PipelineServer
from
paddle_serving_server.pipeline.proto
import
pipeline_service_pb2
from
paddle_serving_server.pipeline.channel
import
ChannelDataErrcode
import
numpy
as
np
from
paddle_serving_app.reader.imdb_reader
import
IMDBDataset
import
logging
try
:
from
paddle_serving_server.web_service
import
WebService
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
_LOGGER
=
logging
.
getLogger
()
user_handler
=
logging
.
StreamHandler
()
...
...
python/examples/pipeline/ocr/web_service.py
浏览文件 @
d5f1da66
...
...
@@ -11,10 +11,7 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
try
:
from
paddle_serving_server_gpu.web_service
import
WebService
,
Op
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
,
Op
from
paddle_serving_server.web_service
import
WebService
,
Op
import
logging
import
numpy
as
np
import
cv2
...
...
@@ -48,7 +45,7 @@ class DetOp(Op):
imgs
=
[]
for
key
in
input_dict
.
keys
():
data
=
base64
.
b64decode
(
input_dict
[
key
].
encode
(
'utf8'
))
data
=
np
.
from
string
(
data
,
np
.
uint8
)
data
=
np
.
from
buffer
(
data
,
np
.
uint8
)
self
.
im
=
cv2
.
imdecode
(
data
,
cv2
.
IMREAD_COLOR
)
self
.
ori_h
,
self
.
ori_w
,
_
=
self
.
im
.
shape
det_img
=
self
.
det_preprocess
(
self
.
im
)
...
...
@@ -57,7 +54,7 @@ class DetOp(Op):
return
{
"image"
:
np
.
concatenate
(
imgs
,
axis
=
0
)},
False
,
None
,
""
def
postprocess
(
self
,
input_dicts
,
fetch_dict
,
log_id
):
# print(fetch_dict)
# print(fetch_dict)
det_out
=
fetch_dict
[
"concat_1.tmp_0"
]
ratio_list
=
[
float
(
self
.
new_h
)
/
self
.
ori_h
,
float
(
self
.
new_w
)
/
self
.
ori_w
...
...
@@ -114,5 +111,5 @@ class OcrService(WebService):
uci_service
=
OcrService
(
name
=
"ocr"
)
uci_service
.
prepare_pipeline_config
(
"config
2
.yml"
)
uci_service
.
prepare_pipeline_config
(
"config.yml"
)
uci_service
.
run_service
()
python/examples/pipeline/simple_web_service/web_service.py
浏览文件 @
d5f1da66
...
...
@@ -11,10 +11,8 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
try
:
from
paddle_serving_server.web_service
import
WebService
,
Op
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
,
Op
from
paddle_serving_server.web_service
import
WebService
,
Op
import
logging
import
numpy
as
np
import
sys
...
...
@@ -34,8 +32,11 @@ class UciOp(Op):
x_value
=
input_dict
[
"x"
].
split
(
self
.
batch_separator
)
x_lst
=
[]
for
x_val
in
x_value
:
x_lst
.
append
(
np
.
array
([
float
(
x
.
strip
())
for
x
in
x_val
.
split
(
self
.
separator
)]).
reshape
(
1
,
13
))
input_dict
[
"x"
]
=
np
.
concatenate
(
x_lst
,
axis
=
0
)
x_lst
.
append
(
np
.
array
([
float
(
x
.
strip
())
for
x
in
x_val
.
split
(
self
.
separator
)
]).
reshape
(
1
,
13
))
input_dict
[
"x"
]
=
np
.
concatenate
(
x_lst
,
axis
=
0
)
proc_dict
=
{}
return
input_dict
,
False
,
None
,
""
...
...
@@ -53,5 +54,5 @@ class UciService(WebService):
uci_service
=
UciService
(
name
=
"uci"
)
uci_service
.
prepare_pipeline_config
(
"config
2
.yml"
)
uci_service
.
prepare_pipeline_config
(
"config.yml"
)
uci_service
.
run_service
()
python/examples/pipeline/simple_web_service/web_service_java.py
浏览文件 @
d5f1da66
...
...
@@ -11,10 +11,7 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
try
:
from
paddle_serving_server.web_service
import
WebService
,
Op
except
ImportError
:
from
paddle_serving_server.web_service
import
WebService
,
Op
from
paddle_serving_server.web_service
import
WebService
,
Op
import
logging
import
numpy
as
np
from
numpy
import
array
...
...
python/examples/senta/senta_web_service.py
浏览文件 @
d5f1da66
...
...
@@ -13,13 +13,9 @@
# See the License for the specific language governing permissions and
# limitations under the License.
from
paddle_serving_server.web_service
import
WebService
from
paddle_serving_client
import
Client
from
paddle_serving_app.reader
import
LACReader
,
SentaReader
import
os
import
sys
import
numpy
as
np
#senta_web_service.py
from
paddle_serving_server.web_service
import
WebService
from
paddle_serving_client
import
Client
from
paddle_serving_app.reader
import
LACReader
,
SentaReader
...
...
python/examples/xpu/fit_a_line_xpu/test_server.py
浏览文件 @
d5f1da66
...
...
@@ -31,6 +31,7 @@ class UciService(WebService):
uci_service
=
UciService
(
name
=
"uci"
)
uci_service
.
load_model_config
(
"uci_housing_model"
)
uci_service
.
prepare_server
(
workdir
=
"workdir"
,
port
=
9393
,
use_lite
=
True
,
use_xpu
=
True
,
ir_optim
=
True
)
uci_service
.
prepare_server
(
workdir
=
"workdir"
,
port
=
9393
,
use_lite
=
True
,
use_xpu
=
True
,
ir_optim
=
True
)
uci_service
.
run_rpc_service
()
uci_service
.
run_web_service
()
python/paddle_serving_app/local_predict.py
浏览文件 @
d5f1da66
...
...
@@ -19,16 +19,12 @@ import os
import
google.protobuf.text_format
import
numpy
as
np
import
argparse
import
paddle.fluid
as
fluid
import
paddle.inference
as
inference
from
.proto
import
general_model_config_pb2
as
m_config
from
paddle.fluid.core
import
PaddleTensor
from
paddle.fluid.core
import
AnalysisConfig
from
paddle.fluid.core
import
create_paddle_predictor
import
paddle.inference
as
paddle_infer
import
logging
logging
.
basicConfig
(
format
=
"%(asctime)s - %(levelname)s - %(message)s"
)
logger
=
logging
.
getLogger
(
"
fluid
"
)
logger
=
logging
.
getLogger
(
"
LocalPredictor
"
)
logger
.
setLevel
(
logging
.
INFO
)
...
...
@@ -62,7 +58,7 @@ class LocalPredictor(object):
use_xpu
=
False
,
use_feed_fetch_ops
=
False
):
"""
Load model config
and set the engine config for the paddle predictor
Load model config
s and create the paddle predictor by Paddle Inference API.
Args:
model_path: model config path.
...
...
@@ -83,14 +79,18 @@ class LocalPredictor(object):
model_conf
=
google
.
protobuf
.
text_format
.
Merge
(
str
(
f
.
read
()),
model_conf
)
if
os
.
path
.
exists
(
os
.
path
.
join
(
model_path
,
"__params__"
)):
config
=
AnalysisConfig
(
os
.
path
.
join
(
model_path
,
"__model__"
),
os
.
path
.
join
(
model_path
,
"__params__"
))
config
=
paddle_infer
.
Config
(
os
.
path
.
join
(
model_path
,
"__model__"
),
os
.
path
.
join
(
model_path
,
"__params__"
))
else
:
config
=
AnalysisConfig
(
model_path
)
logger
.
info
(
"load_model_config params: model_path:{}, use_gpu:{},
\
config
=
paddle_infer
.
Config
(
model_path
)
logger
.
info
(
"LocalPredictor load_model_config params: model_path:{}, use_gpu:{},
\
gpu_id:{}, use_profile:{}, thread_num:{}, mem_optim:{}, ir_optim:{},
\
use_trt:{}, use_lite:{}, use_xpu: {}, use_feed_fetch_ops:{}"
.
format
(
model_path
,
use_gpu
,
gpu_id
,
use_profile
,
thread_num
,
mem_optim
,
ir_optim
,
use_trt
,
use_lite
,
use_xpu
,
use_feed_fetch_ops
))
model_path
,
use_gpu
,
gpu_id
,
use_profile
,
thread_num
,
mem_optim
,
ir_optim
,
use_trt
,
use_lite
,
use_xpu
,
use_feed_fetch_ops
))
self
.
feed_names_
=
[
var
.
alias_name
for
var
in
model_conf
.
feed_var
]
self
.
fetch_names_
=
[
var
.
alias_name
for
var
in
model_conf
.
fetch_var
]
...
...
@@ -129,7 +129,7 @@ class LocalPredictor(object):
if
use_lite
:
config
.
enable_lite_engine
(
precision_mode
=
inference
.
PrecisionType
.
Float32
,
precision_mode
=
paddle_infer
.
PrecisionType
.
Float32
,
zero_copy
=
True
,
passes_filter
=
[],
ops_filter
=
[])
...
...
@@ -138,11 +138,11 @@ class LocalPredictor(object):
# 2MB l3 cache
config
.
enable_xpu
(
8
*
1024
*
1024
)
self
.
predictor
=
create_paddl
e_predictor
(
config
)
self
.
predictor
=
paddle_infer
.
creat
e_predictor
(
config
)
def
predict
(
self
,
feed
=
None
,
fetch
=
None
,
batch
=
False
,
log_id
=
0
):
"""
Predict locally
Run model inference by Paddle Inference API.
Args:
feed: feed var
...
...
@@ -155,14 +155,16 @@ class LocalPredictor(object):
fetch_map: dict
"""
if
feed
is
None
or
fetch
is
None
:
raise
ValueError
(
"You should specify feed and fetch for prediction"
)
raise
ValueError
(
"You should specify feed and fetch for prediction.
\
log_id:{}"
.
format
(
log_id
))
fetch_list
=
[]
if
isinstance
(
fetch
,
str
):
fetch_list
=
[
fetch
]
elif
isinstance
(
fetch
,
list
):
fetch_list
=
fetch
else
:
raise
ValueError
(
"Fetch only accepts string and list of string"
)
raise
ValueError
(
"Fetch only accepts string and list of string.
\
log_id:{}"
.
format
(
log_id
))
feed_batch
=
[]
if
isinstance
(
feed
,
dict
):
...
...
@@ -170,27 +172,21 @@ class LocalPredictor(object):
elif
isinstance
(
feed
,
list
):
feed_batch
=
feed
else
:
raise
ValueError
(
"Feed only accepts dict and list of dict"
)
int_slot_batch
=
[]
float_slot_batch
=
[]
int_feed_names
=
[]
float_feed_names
=
[]
int_shape
=
[]
float_shape
=
[]
fetch_names
=
[]
counter
=
0
batch_size
=
len
(
feed_batch
)
raise
ValueError
(
"Feed only accepts dict and list of dict.
\
log_id:{}"
.
format
(
log_id
))
fetch_names
=
[]
# Filter invalid fetch names
for
key
in
fetch_list
:
if
key
in
self
.
fetch_names_
:
fetch_names
.
append
(
key
)
if
len
(
fetch_names
)
==
0
:
raise
ValueError
(
"Fetch names should not be empty or out of saved fetch list.
"
)
return
{}
"Fetch names should not be empty or out of saved fetch list.
\
log_id:{}"
.
format
(
log_id
))
# Assemble the input data of paddle predictor
input_names
=
self
.
predictor
.
get_input_names
()
for
name
in
input_names
:
if
isinstance
(
feed
[
name
],
list
):
...
...
@@ -204,27 +200,31 @@ class LocalPredictor(object):
feed
[
name
]
=
feed
[
name
].
astype
(
"int32"
)
else
:
raise
ValueError
(
"local predictor receives wrong data type"
)
input_tensor
=
self
.
predictor
.
get_input_tensor
(
name
)
input_tensor
_handle
=
self
.
predictor
.
get_input_handle
(
name
)
if
"{}.lod"
.
format
(
name
)
in
feed
:
input_tensor
.
set_lod
([
feed
[
"{}.lod"
.
format
(
name
)]])
input_tensor
_handle
.
set_lod
([
feed
[
"{}.lod"
.
format
(
name
)]])
if
batch
==
False
:
input_tensor
.
copy_from_cpu
(
feed
[
name
][
np
.
newaxis
,
:])
input_tensor
_handle
.
copy_from_cpu
(
feed
[
name
][
np
.
newaxis
,
:])
else
:
input_tensor
.
copy_from_cpu
(
feed
[
name
])
output_tensors
=
[]
input_tensor
_handle
.
copy_from_cpu
(
feed
[
name
])
output_tensor
_handle
s
=
[]
output_names
=
self
.
predictor
.
get_output_names
()
for
output_name
in
output_names
:
output_tensor
=
self
.
predictor
.
get_output_tensor
(
output_name
)
output_tensors
.
append
(
output_tensor
)
output_tensor_handle
=
self
.
predictor
.
get_output_handle
(
output_name
)
output_tensor_handles
.
append
(
output_tensor_handle
)
# Run inference
self
.
predictor
.
run
()
# Assemble output data of predict results
outputs
=
[]
self
.
predictor
.
zero_copy_run
()
for
output_tensor
in
output_tensors
:
output
=
output_tensor
.
copy_to_cpu
()
for
output_tensor_handle
in
output_tensor_handles
:
output
=
output_tensor_handle
.
copy_to_cpu
()
outputs
.
append
(
output
)
fetch_map
=
{}
for
i
,
name
in
enumerate
(
fetch
):
fetch_map
[
name
]
=
outputs
[
i
]
if
len
(
output_tensors
[
i
].
lod
())
>
0
:
fetch_map
[
name
+
".lod"
]
=
np
.
array
(
output_tensor
s
[
i
].
lod
()[
0
]).
astype
(
'int32'
)
if
len
(
output_tensor
_handle
s
[
i
].
lod
())
>
0
:
fetch_map
[
name
+
".lod"
]
=
np
.
array
(
output_tensor
_handles
[
i
]
.
lod
()[
0
]).
astype
(
'int32'
)
return
fetch_map
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录