diff --git a/README.md b/README.md index f468da74d82443c6804931d432dead17c5b0aecf..202362c60911f6c510ec16c59f6a1f934baa65d3 100644 --- a/README.md +++ b/README.md @@ -25,30 +25,26 @@ PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools -## Recent updates +## 📣 Recent updates - **🔥2022.8.24 Release PaddleOCR [release/2.6](https://github.com/PaddlePaddle/PaddleOCR/tree/release/2.6)** - Release [PP-Structurev2](./ppstructure/),with functions and performance fully upgraded, adapted to Chinese scenes, and new support for [Layout Recovery](./ppstructure/recovery) and **one line command to convert PDF to Word**; - [Layout Analysis](./ppstructure/layout) optimization: model storage reduced by 95%, while speed increased by 11 times, and the average CPU time-cost is only 41ms; - [Table Recognition](./ppstructure/table) optimization: 3 optimization strategies are designed, and the model accuracy is improved by 6% under comparable time consumption; - [Key Information Extraction](./ppstructure/kie) optimization:a visual-independent model structure is designed, the accuracy of semantic entity recognition is increased by 2.8%, and the accuracy of relation extraction is increased by 9.1%. - - **🔥2022.8 Release [OCR scene application collection](./applications/README_en.md)** - Release **9 vertical models** such as digital tube, LCD screen, license plate, handwriting recognition model, high-precision SVTR model, etc, covering the main OCR vertical applications in general, manufacturing, finance, and transportation industries. - - **2022.8 Add implementation of [8 cutting-edge algorithms](doc/doc_en/algorithm_overview_en.md)** - Text Detection: [FCENet](doc/doc_en/algorithm_det_fcenet_en.md), [DB++](doc/doc_en/algorithm_det_db_en.md) - Text Recognition: [ViTSTR](doc/doc_en/algorithm_rec_vitstr_en.md), [ABINet](doc/doc_en/algorithm_rec_abinet_en.md), [VisionLAN](doc/doc_en/algorithm_rec_visionlan_en.md), [SPIN](doc/doc_en/algorithm_rec_spin_en.md), [RobustScanner](doc/doc_en/algorithm_rec_robustscanner_en.md) - Table Recognition: [TableMaster](doc/doc_en/algorithm_table_master_en.md) - - **2022.5.9 Release PaddleOCR [release/2.5](https://github.com/PaddlePaddle/PaddleOCR/tree/release/2.5)** - Release [PP-OCRv3](./doc/doc_en/ppocr_introduction_en.md#pp-ocrv3): With comparable speed, the effect of Chinese scene is further improved by 5% compared with PP-OCRv2, the effect of English scene is improved by 11%, and the average recognition accuracy of 80 language multilingual models is improved by more than 5%. - Release [PPOCRLabelv2](./PPOCRLabel): Add the annotation function for table recognition task, key information extraction task and irregular text image. - Release interactive e-book [*"Dive into OCR"*](./doc/doc_en/ocr_book_en.md), covers the cutting-edge theory and code practice of OCR full stack technology. - - [more](./doc/doc_en/update_en.md) -## Features +## 🌟 Features PaddleOCR support a variety of cutting-edge algorithms related to OCR, and developed industrial featured models/solution [PP-OCR](./doc/doc_en/ppocr_introduction_en.md) and [PP-Structure](./ppstructure/README.md) on this basis, and get through the whole process of data production, model training, compression, inference and deployment. @@ -59,7 +55,7 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel > It is recommended to start with the “quick experience” in the document tutorial -## Quick Experience +## ⚡ Quick Experience - Web online experience for the ultra-lightweight OCR: [Online Experience](https://www.paddlepaddle.org.cn/hub/scene/ocr) - Mobile DEMO experience (based on EasyEdge and Paddle-Lite, supports iOS and Android systems): [Sign in to the website to obtain the QR code for installing the App](https://ai.baidu.com/easyedge/app/openSource?from=paddlelite) @@ -67,22 +63,24 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel -## E-book: *Dive Into OCR* -- [Dive Into OCR 📚](./doc/doc_en/ocr_book_en.md) - +## 📚 E-book: *Dive Into OCR* +- [Dive Into OCR ](./doc/doc_en/ocr_book_en.md) -## Community -- **Join us**👬: Scan the QR code below with your Wechat, you can join the official technical discussion group. Looking forward to your participation. +## 👫 Community +- For international developers, we regard [PaddleOCR Discussions](https://github.com/PaddlePaddle/PaddleOCR/discussions) as our international community platform. All ideas and questions can be discussed here in English. + +- For Chinese develops, Scan the QR code below with your Wechat, you can join the official technical discussion group. For richer community content, please refer to [中文README](README_ch.md), looking forward to your participation.
- +
+ -## PP-OCR Series Model List(Update on September 8th) +## 🛠️ PP-OCR Series Model List(Update on September 8th) | Model introduction | Model name | Recommended scene | Detection model | Direction classifier | Recognition model | | ------------------------------------------------------------ | ---------------------------- | ----------------- | ------------------------------------------------------------ | ------------------------------------------------------------ | ------------------------------------------------------------ | @@ -97,7 +95,7 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel - For a new language request, please refer to [Guideline for new language_requests](#language_requests). - For structural document analysis models, please refer to [PP-Structure models](./ppstructure/docs/models_list_en.md). -## Tutorials +## 📖 Tutorials - [Environment Preparation](./doc/doc_en/environment_en.md) - [PP-OCR 🔥](./doc/doc_en/ppocr_introduction_en.md) - [Quick Start](./doc/doc_en/quickstart_en.md) @@ -158,7 +156,7 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel -## Visualization [more](./doc/doc_en/visualization_en.md) +## 👀 Visualization [more](./doc/doc_en/visualization_en.md)
PP-OCRv3 Chinese model @@ -197,7 +195,7 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel
- +
@@ -222,7 +220,7 @@ PaddleOCR support a variety of cutting-edge algorithms related to OCR, and devel
-## Guideline for New Language Requests +## 🇺🇳 Guideline for New Language Requests If you want to request a new language support, a PR with 1 following files are needed: @@ -235,5 +233,5 @@ More details, please refer to [Multilingual OCR Development Plan](https://github -## License +## 📄 License This project is released under Apache 2.0 license diff --git a/README_ch.md b/README_ch.md index 34c5837beb4747513b8c867d3d3ce56cbca4bf75..2b7f3a39b7277d40d37589ad5ed3af77fd7dd98b 100755 --- a/README_ch.md +++ b/README_ch.md @@ -25,22 +25,34 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 -## 近期更新 +## 📣 近期更新 + +- 💼 **2022.9.5 飞桨智慧金融行业系列直播课** + + - PaddleOCR发布四大范例:印章弯曲文本检测与识别、扫描版合同关键信息抽取、通用卡证结构化信息提取、中文表格识别与属性分析 + + - 9月6日起每周二、周四19点直播,扫码免费加入微信群获取直播链接,与行业专家深度交流 + +
+ +
- **🔥2022.8.24 发布 PaddleOCR [release/2.6](https://github.com/PaddlePaddle/PaddleOCR/tree/release/2.6)** + - 发布[PP-Structurev2](./ppstructure/README_ch.md),系统功能性能全面升级,适配中文场景,新增支持[版面复原](./ppstructure/recovery/README_ch.md),支持**一行命令完成PDF转Word**; - [版面分析](./ppstructure/layout/README_ch.md)模型优化:模型存储减少95%,速度提升11倍,平均CPU耗时仅需41ms; - [表格识别](./ppstructure/table/README_ch.md)模型优化:设计3大优化策略,预测耗时不变情况下,模型精度提升6%; - [关键信息抽取](./ppstructure/kie/README_ch.md)模型优化:设计视觉无关模型结构,语义实体识别精度提升2.8%,关系抽取精度提升9.1%。 - + - **🔥2022.8 发布 [OCR场景应用集合](./applications)** + - 包含数码管、液晶屏、车牌、高精度SVTR模型、手写体识别等**9个垂类模型**,覆盖通用,制造、金融、交通行业的主要OCR垂类应用。 - + - **2022.8 新增实现[8种前沿算法](doc/doc_ch/algorithm_overview.md)** - 文本检测:[FCENet](doc/doc_ch/algorithm_det_fcenet.md), [DB++](doc/doc_ch/algorithm_det_db.md) - 文本识别:[ViTSTR](doc/doc_ch/algorithm_rec_vitstr.md), [ABINet](doc/doc_ch/algorithm_rec_abinet.md), [VisionLAN](doc/doc_ch/algorithm_rec_visionlan.md), [SPIN](doc/doc_ch/algorithm_rec_spin.md), [RobustScanner](doc/doc_ch/algorithm_rec_robustscanner.md) - 表格识别:[TableMaster](doc/doc_ch/algorithm_table_master.md) - + - **2022.5.9 发布 PaddleOCR [release/2.5](https://github.com/PaddlePaddle/PaddleOCR/tree/release/2.5)** - 发布[PP-OCRv3](./doc/doc_ch/ppocr_introduction.md#pp-ocrv3),速度可比情况下,中文场景效果相比于PP-OCRv2再提升5%,英文场景提升11%,80语种多语言模型平均识别准确率提升5%以上; - 发布半自动标注工具[PPOCRLabelv2](./PPOCRLabel):新增表格文字图像、图像关键信息抽取任务和不规则文字图像的标注功能; @@ -49,7 +61,7 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 > [更多](./doc/doc_ch/update.md) -## 特性 +## 🌟 特性 支持多种OCR相关前沿算法,在此基础上打造产业级特色模型[PP-OCR](./doc/doc_ch/ppocr_introduction.md)和[PP-Structure](./ppstructure/README_ch.md),并打通数据生产、模型训练、压缩、预测部署全流程。 @@ -60,25 +72,25 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 > 上述内容的使用方法建议从文档教程中的快速开始体验 -## 快速开始 +## ⚡ 快速开始 - 在线网站体验:超轻量PP-OCR mobile模型体验地址:https://www.paddlepaddle.org.cn/hub/scene/ocr - 移动端demo体验:[安装包DEMO下载地址](https://ai.baidu.com/easyedge/app/openSource?from=paddlelite)(基于EasyEdge和Paddle-Lite, 支持iOS和Android系统) - 一行命令快速使用:[快速开始(中英文/多语言/文档分析)](./doc/doc_ch/quickstart.md) -## 《动手学OCR》电子书 -- [《动手学OCR》电子书📚](./doc/doc_ch/ocr_book.md) +## 📚《动手学OCR》电子书 +- [《动手学OCR》电子书](./doc/doc_ch/ocr_book.md) -## 开源社区 -- **项目合作📑:** 如果您是企业开发者且有明确的OCR垂类应用需求,填写[问卷](https://paddle.wjx.cn/vj/QwF7GKw.aspx)后可免费与官方团队展开不同层次的合作。 -- **加入社区👬:** 微信扫描二维码并填写问卷之后,加入交流群领取福利 +## 👫 开源社区 +- **📑项目合作:** 如果您是企业开发者且有明确的OCR垂类应用需求,填写[问卷](https://paddle.wjx.cn/vj/QwF7GKw.aspx)后可免费与官方团队展开不同层次的合作。 +- **👫加入社区:** 微信扫描二维码并填写问卷之后,加入交流群领取福利 - **获取PaddleOCR最新发版解说《OCR超强技术详解与产业应用实战》系列直播课回放链接** - **10G重磅OCR学习大礼包:**《动手学OCR》电子书,配套讲解视频和notebook项目;66篇OCR相关顶会前沿论文打包放送,包括CVPR、AAAI、IJCAI、ICCV等;PaddleOCR历次发版直播课视频;OCR社区优秀开发者项目分享视频。 -- **社区项目**🏅️:[社区项目](./doc/doc_ch/thirdparty.md)文档中包含了社区用户**使用PaddleOCR开发的各种工具、应用**以及**为PaddleOCR贡献的功能、优化的文档与代码**等,是官方为社区开发者打造的荣誉墙,也是帮助优质项目宣传的广播站。 -- **社区常规赛**🎁:社区常规赛是面向OCR开发者的积分赛事,覆盖文档、代码、模型和应用四大类型,以季度为单位评选并发放奖励,赛题详情与报名方法可参考[链接](https://github.com/PaddlePaddle/PaddleOCR/issues/4982)。 +- **🏅️社区项目**:[社区项目](./doc/doc_ch/thirdparty.md)文档中包含了社区用户**使用PaddleOCR开发的各种工具、应用**以及**为PaddleOCR贡献的功能、优化的文档与代码**等,是官方为社区开发者打造的荣誉墙,也是帮助优质项目宣传的广播站。 +- **🎁社区常规赛**:社区常规赛是面向OCR开发者的积分赛事,覆盖文档、代码、模型和应用四大类型,以季度为单位评选并发放奖励,赛题详情与报名方法可参考[链接](https://github.com/PaddlePaddle/PaddleOCR/issues/4982)。
@@ -87,7 +99,7 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 -## PP-OCR系列模型列表(更新中) +## 🛠️ PP-OCR系列模型列表(更新中) | 模型简介 | 模型名称 | 推荐场景 | 检测模型 | 方向分类器 | 识别模型 | | ------------------------------------- | ----------------------- | --------------- | ------------------------------------------------------------ | ------------------------------------------------------------ | ------------------------------------------------------------ | @@ -108,7 +120,7 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 -## 文档教程 +## 📖 文档教程 - [运行环境准备](./doc/doc_ch/environment.md) - [PP-OCR文本检测识别🔥](./doc/doc_ch/ppocr_introduction.md) @@ -174,7 +186,7 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力 -## 效果展示 [more](./doc/doc_ch/visualization.md) +## 👀 效果展示 [more](./doc/doc_ch/visualization.md)
PP-OCRv3 中文模型 @@ -225,11 +237,11 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力
- +
- + - RE(关系提取)
@@ -242,7 +254,7 @@ PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库,助力
- +