Release Note

  • Release PP-Structurev2,with functions and performance fully upgraded, adapted to Chinese scenes, and new support for Layout Recovery and one line command to convert PDF to Word;
  • Layout Analysis optimization: model storage reduced by 95%, while speed increased by 11 times, and the average CPU time-cost is only 41ms;
  • Table Recognition optimization: 3 optimization strategies are designed, and the model accuracy is improved by 6% under comparable time consumption;
  • Key Information Extraction optimization:a visual-independent model structure is designed, the accuracy of semantic entity recognition is increased by 2.8%, and the accuracy of relation extraction is increased by 9.1%.

项目简介

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/PaddlePaddle/PaddleOCR

发行版本 6

PaddleOCRv2.6.0

全部发行版

贡献者 67

全部贡献者

开发语言

  • Python 79.1 %
  • C++ 17.6 %
  • Java 2.6 %
  • CMake 0.5 %
  • Makefile 0.2 %