From 0d6a48625e7bd1d648fc9a5023ce6950d3fbb77b Mon Sep 17 00:00:00 2001 From: WenmuZhou <572459439@qq.com> Date: Wed, 27 Apr 2022 19:36:23 +0800 Subject: [PATCH] mv layout and doc vqa dataset to docs/dataset --- doc/doc_en/dataset/ocr_datasets_en.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/doc/doc_en/dataset/ocr_datasets_en.md b/doc/doc_en/dataset/ocr_datasets_en.md index c05fb87d..0b9abd52 100644 --- a/doc/doc_en/dataset/ocr_datasets_en.md +++ b/doc/doc_en/dataset/ocr_datasets_en.md @@ -73,7 +73,7 @@ After decompressing the data set and downloading the annotation file, PaddleOCR/ The text recognition algorithm in PaddleOCR supports two data formats: - `lmdb` is used to train data sets stored in lmdb format, use [lmdb_dataset.py](../../../ppocr/data/lmdb_dataset.py) to load; - - `通用数据` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load. + - `common dataset` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load. If you want to use your own data for training, please refer to the following to organize your data. -- GitLab