* Note 1: The language options is correspond to the corpus. Currently, the tool only supports English, Simplified Chinese and Korean.
* Note 1: The language options is correspond to the corpus. Currently, the tool only supports English, Simplified Chinese and Korean.
* Note 2: Synth-Text is mainly used to generate images for OCR recognition models.
* Note 2: Synth-Text is mainly used to generate images for OCR recognition models.
So the height of style images should be around 32 pixels. Images in other sizes may behave poorly.
So the height of style images should be around 32 pixels. Images in other sizes may behave poorly.
* Note 3: You can modify `use_gpu` in `configs/config.yml` to determine whether to use GPU for prediction.
For example, enter the following image and corpus `PaddleOCR`.
For example, enter the following image and corpus `PaddleOCR`.
...
@@ -122,7 +123,7 @@ In actual application scenarios, it is often necessary to synthesize pictures in
...
@@ -122,7 +123,7 @@ In actual application scenarios, it is often necessary to synthesize pictures in
*`corpus_file`: Filepath of the corpus. Corpus file should be a text file which will be split by line-endings('\n'). Corpus generator samples one line each time.
*`corpus_file`: Filepath of the corpus. Corpus file should be a text file which will be split by line-endings('\n'). Corpus generator samples one line each time.
Example of corpus file:
Example of corpus file:
```
```
PaddleOCR
PaddleOCR
飞桨文字识别
飞桨文字识别
...
@@ -139,9 +140,9 @@ We provide a general dataset containing Chinese, English and Korean (50,000 imag
...
@@ -139,9 +140,9 @@ We provide a general dataset containing Chinese, English and Korean (50,000 imag
2. You can run the following command to start synthesis task:
2. You can run the following command to start synthesis task: