Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
PaddleOCR
提交
01732bd9
P
PaddleOCR
项目概览
PaddlePaddle
/
PaddleOCR
大约 1 年 前同步成功
通知
1528
Star
32962
Fork
6643
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
108
列表
看板
标记
里程碑
合并请求
7
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PaddleOCR
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
108
Issue
108
列表
看板
标记
里程碑
合并请求
7
合并请求
7
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
01732bd9
编写于
4月 12, 2021
作者:
T
tink2123
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
cherry-pick en multi-doc
上级
91980595
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
43 addition
and
23 deletion
+43
-23
doc/doc_en/multi_languages_en.md
doc/doc_en/multi_languages_en.md
+43
-23
未找到文件。
doc/doc_en/multi_languages_en.md
浏览文件 @
01732bd9
...
...
@@ -2,24 +2,44 @@
**Recent Update**
-2021.4.9 supports the detection and recognition of 80 languages
-2021.4.9 supports
**lightweight high-precision**
English model detection and recognition
-
[
1 Installation
](
#Install
)
-
[
1.1 paddle installation
](
#paddleinstallation
)
-
[
1.2 paddleocr package installation
](
#paddleocr_package_install
)
-
[
2 Quick Use
](
#Quick_Use
)
-
[
2.1 Command line operation
](
#Command_line_operation
)
-
[
2.1.1 Prediction of the whole image
](
#bash_detection+recognition
)
-
[
2.1.2 Recognition
](
#bash_Recognition
)
-
[
2.1.3 Detection
](
#bash_detection
)
-
[
2.2 python script running
](
#python_Script_running
)
-
[
2.2.1 Whole image prediction
](
#python_detection+recognition
)
-
[
2.2.2 Recognition
](
#python_Recognition
)
-
[
2.2.3 Detection
](
#python_detection
)
-
[
3 Custom Training
](
#Custom_Training
)
-
[
4 Supported languages and abbreviations
](
#language_abbreviations
)
-
2021.4.9 supports the detection and recognition of 80 languages
-
2021.4.9 supports
**lightweight high-precision**
English model detection and recognition
PaddleOCR aims to create a rich, leading, and practical OCR tool library, which not only provides
Chinese and English models in general scenarios, but also provides models specifically trained
in English scenarios. And multilingual models covering
[
80 languages
](
#language_abbreviations
)
.
Among them, the English model supports the detection and recognition of uppercase and lowercase
letters and common punctuation, and the recognition of space characters is optimized:
<div
align=
"center"
>
<img
src=
"../imgs_results/multi_lang/en_1.jpg"
width=
"400"
height=
"600"
>
</div>
The multilingual models cover Latin, Arabic, Traditional Chinese, Korean, Japanese, etc.:
<div
align=
"center"
>
<img
src=
"../imgs_results/multi_lang/japan_2.jpg"
width=
"600"
height=
"300"
>
<img
src=
"../imgs_results/multi_lang/french_0.jpg"
width=
"300"
height=
"300"
>
</div>
This document will briefly introduce how to use the multilingual model.
-
[
1 Installation
](
#Install
)
-
[
1.1 paddle installation
](
#paddleinstallation
)
-
[
1.2 paddleocr package installation
](
#paddleocr_package_install
)
-
[
2 Quick Use
](
#Quick_Use
)
-
[
2.1 Command line operation
](
#Command_line_operation
)
-
[
2.1.1 Prediction of the whole image
](
#bash_detection+recognition
)
-
[
2.1.2 Recognition
](
#bash_Recognition
)
-
[
2.1.3 Detection
](
#bash_detection
)
-
[
2.2 python script running
](
#python_Script_running
)
-
[
2.2.1 Whole image prediction
](
#python_detection+recognition
)
-
[
2.2.2 Recognition
](
#python_Recognition
)
-
[
2.2.3 Detection
](
#python_detection
)
-
[
3 Custom Training
](
#Custom_Training
)
-
[
4 Supported languages and abbreviations
](
#language_abbreviations
)
<a
name=
"Install"
></a>
## 1 Installation
...
...
@@ -69,7 +89,7 @@ The specific supported [language] (#language_abbreviations) can be viewed in the
paddleocr
--image_dir
doc/imgs/japan_2.jpg
--lang
=
japan
```
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
0
/doc/imgs/japan_2.jpg
)
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
1
/doc/imgs/japan_2.jpg
)
The result is a list, each item contains a text box, text and recognition confidence
```
text
...
...
@@ -86,7 +106,7 @@ The result is a list, each item contains a text box, text and recognition confid
paddleocr
--image_dir
doc/imgs_words/japan/1.jpg
--det
false
--lang
=
japan
```
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
0
/doc/imgs_words/japan/1.jpg
)
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
1
/doc/imgs_words/japan/1.jpg
)
The result is a tuple, which returns the recognition result and recognition confidence
...
...
@@ -139,7 +159,7 @@ im_show.save('result.jpg')
```
Visualization of results:
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
0
/doc/imgs_results/korean.jpg
)
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
1
/doc/imgs_results/korean.jpg
)
*
Recognition
...
...
@@ -153,7 +173,7 @@ for line in result:
print(line)
```
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
0
/doc/imgs_words/german/1.jpg
)
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
1
/doc/imgs_words/german/1.jpg
)
The result is a tuple, which only contains the recognition result and recognition confidence
...
...
@@ -188,7 +208,7 @@ The result is a list, each item contains only text boxes
```
Visualization of results:
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
0
/doc/imgs_results/whl/12_det.jpg
)
![](
https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/release/2.
1
/doc/imgs_results/whl/12_det.jpg
)
ppocr also supports direction classification. For more usage methods, please refer to:
[
whl package instructions
](
https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.0/doc/doc_ch/whl.md
)
.
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录