models_list_en.md 5.2 KB
Newer Older
文幕地方's avatar
文幕地方 已提交
1
# PP-Structure Model list
M
update  
MissPenguin 已提交
2

3 4 5 6 7 8 9
- [PP-Structure Model list](#pp-structure-model-list)
  - [1. Layout Analysis](#1-layout-analysis)
  - [2. OCR and Table Recognition](#2-ocr-and-table-recognition)
    - [2.1 OCR](#21-ocr)
    - [2.2 Table Recognition](#22-table-recognition)
  - [3. VQA](#3-vqa)
  - [4. KIE](#4-kie)
M
update  
MissPenguin 已提交
10 11 12


<a name="1"></a>
文幕地方's avatar
文幕地方 已提交
13
## 1. Layout Analysis
M
update  
MissPenguin 已提交
14

文幕地方's avatar
文幕地方 已提交
15 16 17 18 19
|model name| description                                                                                                                                             |download|label_map|
| --- |---------------------------------------------------------------------------------------------------------------------------------------------------------| --- | --- |
| ppyolov2_r50vd_dcn_365e_publaynet | The layout analysis model trained on the PubLayNet dataset, the model can recognition 5 types of areas such as **text, title, table, picture and list** | [inference model](https://paddle-model-ecology.bj.bcebos.com/model/layout-parser/ppyolov2_r50vd_dcn_365e_publaynet.tar) / [trained model](https://paddle-model-ecology.bj.bcebos.com/model/layout-parser/ppyolov2_r50vd_dcn_365e_publaynet_pretrained.pdparams) |{0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"}|
| ppyolov2_r50vd_dcn_365e_tableBank_word | The layout analysis model trained on the TableBank Word dataset, the model can only detect tables                                                       | [inference model](https://paddle-model-ecology.bj.bcebos.com/model/layout-parser/ppyolov2_r50vd_dcn_365e_tableBank_word.tar) | {0:"Table"}|
| ppyolov2_r50vd_dcn_365e_tableBank_latex | The layout analysis model trained on the TableBank Latex dataset, the model can only detect tables                                                      | [inference model](https://paddle-model-ecology.bj.bcebos.com/model/layout-parser/ppyolov2_r50vd_dcn_365e_tableBank_latex.tar) | {0:"Table"}|
M
update  
MissPenguin 已提交
20 21

<a name="2"></a>
文幕地方's avatar
文幕地方 已提交
22
## 2. OCR and Table Recognition
M
update  
MissPenguin 已提交
23 24 25 26

<a name="21"></a>
### 2.1 OCR

文幕地方's avatar
文幕地方 已提交
27 28 29 30
|model name| description | inference model size |download|
| --- |---|---| --- |
|en_ppocr_mobile_v2.0_table_det| Text detection model of English table scenes trained on PubTabNet dataset | 4.7M                |[inference model](https://paddleocr.bj.bcebos.com/dygraph_v2.0/table/en_ppocr_mobile_v2.0_table_det_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/dygraph_v2.1/table/en_ppocr_mobile_v2.0_table_det_train.tar) |
|en_ppocr_mobile_v2.0_table_rec| Text recognition model of English table scenes trained on PubTabNet dataset | 6.9M                |[inference model](https://paddleocr.bj.bcebos.com/dygraph_v2.0/table/en_ppocr_mobile_v2.0_table_rec_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/dygraph_v2.1/table/en_ppocr_mobile_v2.0_table_rec_train.tar) |
M
update  
MissPenguin 已提交
31

文幕地方's avatar
文幕地方 已提交
32
If you need to use other OCR models, you can download the model in [PP-OCR model_list](../../doc/doc_ch/models_list.md) or use the model you trained yourself to configure to `det_model_dir`, `rec_model_dir` field.
M
update  
MissPenguin 已提交
33 34

<a name="22"></a>
文幕地方's avatar
文幕地方 已提交
35
### 2.2 Table Recognition
M
update  
MissPenguin 已提交
36

文幕地方's avatar
文幕地方 已提交
37 38 39
|model| description                                                                 |inference model size|download|
| --- |-----------------------------------------------------------------------------| --- | --- |
|en_ppocr_mobile_v2.0_table_structure| Table structure model for English table scenes trained on PubTabNet dataset |18.6M|[inference model](https://paddleocr.bj.bcebos.com/dygraph_v2.0/table/en_ppocr_mobile_v2.0_table_structure_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/dygraph_v2.1/table/en_ppocr_mobile_v2.0_table_structure_train.tar) |
M
update  
MissPenguin 已提交
40 41

<a name="3"></a>
文幕地方's avatar
文幕地方 已提交
42
## 3. VQA
M
update  
MissPenguin 已提交
43

文幕地方's avatar
文幕地方 已提交
44 45
|model| description                                                    |inference model size|download|
| --- |----------------------------------------------------------------| --- | --- |
46 47 48
|ser_LayoutXLM_xfun_zh| SER model trained on xfun Chinese dataset based on LayoutXLM   |1.4G|[inference model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutXLM_xfun_zh_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutXLM_xfun_zh.tar) |
|re_LayoutXLM_xfun_zh| Re model trained on xfun Chinese dataset based on LayoutXLM    |1.4G|[inference model coming soon]() / [trained model](https://paddleocr.bj.bcebos.com/pplayout/re_LayoutXLM_xfun_zh.tar) |
|ser_LayoutLMv2_xfun_zh| SER model trained on xfun Chinese dataset based on LayoutXLMv2 |778M|[inference model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutLMv2_xfun_zh_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutLMv2_xfun_zh.tar) |
文幕地方's avatar
文幕地方 已提交
49
|re_LayoutLMv2_xfun_zh| Re model trained on xfun Chinese dataset based on LayoutXLMv2  |765M|[inference model coming soon]() / [trained model](https://paddleocr.bj.bcebos.com/pplayout/re_LayoutLMv2_xfun_zh.tar) |
50
|ser_LayoutLM_xfun_zh| SER model trained on xfun Chinese dataset based on LayoutLM    |430M|[inference model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutLM_xfun_zh_infer.tar) / [trained model](https://paddleocr.bj.bcebos.com/pplayout/ser_LayoutLM_xfun_zh.tar) |
M
update  
MissPenguin 已提交
51 52

<a name="4"></a>
文幕地方's avatar
文幕地方 已提交
53
## 4. KIE
M
update  
MissPenguin 已提交
54

文幕地方's avatar
文幕地方 已提交
55
|model|description|model size|download|
M
update  
MissPenguin 已提交
56
| --- | --- | --- | --- |
文幕地方's avatar
文幕地方 已提交
57
|SDMGR|Key Information Extraction Model|78M|[inference model coming soon]() / [trained model](https://paddleocr.bj.bcebos.com/dygraph_v2.1/kie/kie_vgg16.tar)|