Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
PaddleOCR
提交
dd0f8c1d
P
PaddleOCR
项目概览
PaddlePaddle
/
PaddleOCR
大约 1 年 前同步成功
通知
1528
Star
32962
Fork
6643
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
108
列表
看板
标记
里程碑
合并请求
7
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PaddleOCR
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
108
Issue
108
列表
看板
标记
里程碑
合并请求
7
合并请求
7
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
dd0f8c1d
编写于
12月 08, 2020
作者:
T
tink2123
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
update for multi-language
上级
8a5566c9
变更
7
隐藏空白更改
内联
并排
Showing
7 changed file
with
15 addition
and
11 deletion
+15
-11
configs/rec/multi_language/rec_en_number_lite_train.yml
configs/rec/multi_language/rec_en_number_lite_train.yml
+1
-1
configs/rec/multi_language/rec_french_lite_train.yml
configs/rec/multi_language/rec_french_lite_train.yml
+2
-2
configs/rec/multi_language/rec_german_lite_train.yml
configs/rec/multi_language/rec_german_lite_train.yml
+1
-1
configs/rec/multi_language/rec_japan_lite_train.yml
configs/rec/multi_language/rec_japan_lite_train.yml
+1
-1
configs/rec/multi_language/rec_korean_lite_train.yml
configs/rec/multi_language/rec_korean_lite_train.yml
+1
-1
ppocr/data/imaug/label_ops.py
ppocr/data/imaug/label_ops.py
+4
-2
ppocr/postprocess/rec_postprocess.py
ppocr/postprocess/rec_postprocess.py
+5
-3
未找到文件。
configs/rec/multi_language/rec_en_number_lite_train.yml
浏览文件 @
dd0f8c1d
...
@@ -15,7 +15,7 @@ Global:
...
@@ -15,7 +15,7 @@ Global:
use_visualdl
:
False
use_visualdl
:
False
infer_img
:
infer_img
:
# for data or label process
# for data or label process
character_dict_path
:
ppocr/utils/ic15_dict.txt
character_dict_path
:
ppocr/utils/
dict/
ic15_dict.txt
character_type
:
ch
character_type
:
ch
max_text_length
:
25
max_text_length
:
25
infer_mode
:
False
infer_mode
:
False
...
...
configs/rec/multi_language/rec_french_lite_train.yml
浏览文件 @
dd0f8c1d
...
@@ -15,7 +15,7 @@ Global:
...
@@ -15,7 +15,7 @@ Global:
use_visualdl
:
False
use_visualdl
:
False
infer_img
:
infer_img
:
# for data or label process
# for data or label process
character_dict_path
:
ppocr/utils/french_dict.txt
character_dict_path
:
ppocr/utils/
dict/
french_dict.txt
character_type
:
french
character_type
:
french
max_text_length
:
25
max_text_length
:
25
infer_mode
:
False
infer_mode
:
False
...
@@ -85,7 +85,7 @@ Eval:
...
@@ -85,7 +85,7 @@ Eval:
dataset
:
dataset
:
name
:
SimpleDataSet
name
:
SimpleDataSet
data_dir
:
./train_data/
data_dir
:
./train_data/
label_file_list
:
[
"
./train_data/
eval
_list.txt"
]
label_file_list
:
[
"
./train_data/
train
_list.txt"
]
transforms
:
transforms
:
-
DecodeImage
:
# load image
-
DecodeImage
:
# load image
img_mode
:
BGR
img_mode
:
BGR
...
...
configs/rec/multi_language/rec_german_lite_train.yml
浏览文件 @
dd0f8c1d
...
@@ -15,7 +15,7 @@ Global:
...
@@ -15,7 +15,7 @@ Global:
use_visualdl
:
False
use_visualdl
:
False
infer_img
:
infer_img
:
# for data or label process
# for data or label process
character_dict_path
:
ppocr/utils/german_dict.txt
character_dict_path
:
ppocr/utils/
dict/
german_dict.txt
character_type
:
german
character_type
:
german
max_text_length
:
25
max_text_length
:
25
infer_mode
:
False
infer_mode
:
False
...
...
configs/rec/multi_language/rec_japan_lite_train.yml
浏览文件 @
dd0f8c1d
...
@@ -15,7 +15,7 @@ Global:
...
@@ -15,7 +15,7 @@ Global:
use_visualdl
:
False
use_visualdl
:
False
infer_img
:
infer_img
:
# for data or label process
# for data or label process
character_dict_path
:
ppocr/utils/japan_dict.txt
character_dict_path
:
ppocr/utils/
dict/
japan_dict.txt
character_type
:
japan
character_type
:
japan
max_text_length
:
25
max_text_length
:
25
infer_mode
:
False
infer_mode
:
False
...
...
configs/rec/multi_language/rec_korean_lite_train.yml
浏览文件 @
dd0f8c1d
...
@@ -15,7 +15,7 @@ Global:
...
@@ -15,7 +15,7 @@ Global:
use_visualdl
:
False
use_visualdl
:
False
infer_img
:
infer_img
:
# for data or label process
# for data or label process
character_dict_path
:
ppocr/utils/korean_dict.txt
character_dict_path
:
ppocr/utils/
dict/
korean_dict.txt
character_type
:
korean
character_type
:
korean
max_text_length
:
25
max_text_length
:
25
infer_mode
:
False
infer_mode
:
False
...
...
ppocr/data/imaug/label_ops.py
浏览文件 @
dd0f8c1d
...
@@ -79,7 +79,9 @@ class BaseRecLabelEncode(object):
...
@@ -79,7 +79,9 @@ class BaseRecLabelEncode(object):
character_dict_path
=
None
,
character_dict_path
=
None
,
character_type
=
'ch'
,
character_type
=
'ch'
,
use_space_char
=
False
):
use_space_char
=
False
):
support_character_type
=
[
'ch'
,
'en'
,
'en_sensitive'
]
support_character_type
=
[
'ch'
,
'en'
,
'en_sensitive'
,
'french'
,
'german'
,
'japan'
,
'french'
]
assert
character_type
in
support_character_type
,
"Only {} are supported now but get {}"
.
format
(
assert
character_type
in
support_character_type
,
"Only {} are supported now but get {}"
.
format
(
support_character_type
,
self
.
character_str
)
support_character_type
,
self
.
character_str
)
...
@@ -87,7 +89,7 @@ class BaseRecLabelEncode(object):
...
@@ -87,7 +89,7 @@ class BaseRecLabelEncode(object):
if
character_type
==
"en"
:
if
character_type
==
"en"
:
self
.
character_str
=
"0123456789abcdefghijklmnopqrstuvwxyz"
self
.
character_str
=
"0123456789abcdefghijklmnopqrstuvwxyz"
dict_character
=
list
(
self
.
character_str
)
dict_character
=
list
(
self
.
character_str
)
elif
character_type
==
"ch"
:
elif
character_type
in
[
"ch"
,
"french"
,
"german"
,
"japan"
,
"french"
]
:
self
.
character_str
=
""
self
.
character_str
=
""
assert
character_dict_path
is
not
None
,
"character_dict_path should not be None when character_type is ch"
assert
character_dict_path
is
not
None
,
"character_dict_path should not be None when character_type is ch"
with
open
(
character_dict_path
,
"rb"
)
as
fin
:
with
open
(
character_dict_path
,
"rb"
)
as
fin
:
...
...
ppocr/postprocess/rec_postprocess.py
浏览文件 @
dd0f8c1d
...
@@ -23,14 +23,16 @@ class BaseRecLabelDecode(object):
...
@@ -23,14 +23,16 @@ class BaseRecLabelDecode(object):
character_dict_path
=
None
,
character_dict_path
=
None
,
character_type
=
'ch'
,
character_type
=
'ch'
,
use_space_char
=
False
):
use_space_char
=
False
):
support_character_type
=
[
'ch'
,
'en'
,
'en_sensitive'
]
support_character_type
=
[
'ch'
,
'en'
,
'en_sensitive'
,
'french'
,
'german'
,
'japan'
,
'french'
]
assert
character_type
in
support_character_type
,
"Only {} are supported now but get {}"
.
format
(
assert
character_type
in
support_character_type
,
"Only {} are supported now but get {}"
.
format
(
support_character_type
,
self
.
character_str
)
support_character_type
,
self
.
character_str
)
if
character_type
==
"en"
:
if
character_type
==
"en"
:
self
.
character_str
=
"0123456789abcdefghijklmnopqrstuvwxyz"
self
.
character_str
=
"0123456789abcdefghijklmnopqrstuvwxyz"
dict_character
=
list
(
self
.
character_str
)
dict_character
=
list
(
self
.
character_str
)
elif
character_type
==
"ch"
:
elif
character_type
in
[
"ch"
,
"french"
,
"german"
,
"japan"
,
"french"
]
:
self
.
character_str
=
""
self
.
character_str
=
""
assert
character_dict_path
is
not
None
,
"character_dict_path should not be None when character_type is ch"
assert
character_dict_path
is
not
None
,
"character_dict_path should not be None when character_type is ch"
with
open
(
character_dict_path
,
"rb"
)
as
fin
:
with
open
(
character_dict_path
,
"rb"
)
as
fin
:
...
@@ -150,4 +152,4 @@ class AttnLabelDecode(BaseRecLabelDecode):
...
@@ -150,4 +152,4 @@ class AttnLabelDecode(BaseRecLabelDecode):
else
:
else
:
assert
False
,
"unsupport type %s in get_beg_end_flag_idx"
\
assert
False
,
"unsupport type %s in get_beg_end_flag_idx"
\
%
beg_or_end
%
beg_or_end
return
idx
return
idx
\ No newline at end of file
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录