diff --git a/StyleText/README.md b/StyleText/README.md index 648b12674d23a9f413317644cc198fd7fda24bc8..632f7b6daf21f5bde6a81a7f4e9cef0ca5abd8e7 100644 --- a/StyleText/README.md +++ b/StyleText/README.md @@ -69,12 +69,13 @@ fusion_generator: 1. You can run `tools/synth_image` and generate the demo image, which is saved in the current folder. ```python -python3 -m tools.synth_image -c configs/config.yml --style_image examples/style_images/2.jpg --text_corpus PaddleOCR --language en +python3 tools/synth_image.py -c configs/config.yml --style_image examples/style_images/2.jpg --text_corpus PaddleOCR --language en ``` * Note 1: The language options is correspond to the corpus. Currently, the tool only supports English, Simplified Chinese and Korean. -* Note 2: Synth-Text is mainly used to generate images for OCR recognition models. +* Note 2: Synth-Text is mainly used to generate images for OCR recognition models. So the height of style images should be around 32 pixels. Images in other sizes may behave poorly. +* Note 3: You can modify `use_gpu` in `configs/config.yml` to determine whether to use GPU for prediction. For example, enter the following image and corpus `PaddleOCR`. @@ -122,7 +123,7 @@ In actual application scenarios, it is often necessary to synthesize pictures in * `corpus_file`: Filepath of the corpus. Corpus file should be a text file which will be split by line-endings('\n'). Corpus generator samples one line each time. -Example of corpus file: +Example of corpus file: ``` PaddleOCR 飞桨文字识别 @@ -139,9 +140,9 @@ We provide a general dataset containing Chinese, English and Korean (50,000 imag 2. You can run the following command to start synthesis task: ``` bash - python -m tools.synth_dataset.py -c configs/dataset_config.yml + python3 tools/synth_dataset.py -c configs/dataset_config.yml ``` -We also provide example corpus and images in `examples` folder. +We also provide example corpus and images in `examples` folder.
diff --git a/StyleText/README_ch.md b/StyleText/README_ch.md index 0dd5822b1eac488099477d289dff83a99577b8c9..5b8a3ee0fef321ed9ccee7733a74645234c44a12 100644 --- a/StyleText/README_ch.md +++ b/StyleText/README_ch.md @@ -61,11 +61,12 @@ fusion_generator: 输入一张风格图和一段文字语料,运行tools/synth_image,合成单张图片,结果图像保存在当前目录下: ```python -python3 -m tools.synth_image -c configs/config.yml --style_image examples/style_images/2.jpg --text_corpus PaddleOCR --language en +python3 tools/synth_image.py -c configs/config.yml --style_image examples/style_images/2.jpg --text_corpus PaddleOCR --language en ``` * 注1:语言选项和语料相对应,目前该工具只支持英文、简体中文和韩语。 * 注2:Style-Text生成的数据主要应用于OCR识别场景。基于当前PaddleOCR识别模型的设计,我们主要支持高度在32左右的风格图像。 如果输入图像尺寸相差过多,效果可能不佳。 +* 注3:可以通过修改配置文件中的`use_gpu`(true或者false)参数来决定是否使用GPU进行预测。 例如,输入如下图片和语料"PaddleOCR": @@ -127,7 +128,7 @@ python3 -m tools.synth_image -c configs/config.yml --style_image examples/style_ 2. 运行`tools/synth_dataset`合成数据: ``` bash - python -m tools.synth_dataset -c configs/dataset_config.yml + python tools/synth_dataset.py -c configs/dataset_config.yml ``` 我们在examples目录下提供了样例图片和语料。
diff --git a/StyleText/engine/predictors.py b/StyleText/engine/predictors.py index d9f4afe4a18bd1e0a96ac37aa0359f26434ddb3d..a1ba21f1b6cd084f9f95140d4227d600d4631715 100644 --- a/StyleText/engine/predictors.py +++ b/StyleText/engine/predictors.py @@ -28,6 +28,7 @@ class StyleTextRecPredictor(object): ], "Generator {} not supported.".format(algorithm) use_gpu = config["Global"]['use_gpu'] check_gpu(use_gpu) + paddle.set_device('gpu' if use_gpu else 'cpu') self.logger = get_logger() self.generator = getattr(style_text_rec, algorithm)(config) self.height = config["Global"]["image_height"] diff --git a/StyleText/tools/synth_dataset.py b/StyleText/tools/synth_dataset.py index 4a0e6d5e1f701c49558cfe1ea1df61e9b4180a89..a75f7f393b6a0825bc9735e00a50c468f9b4a1ae 100644 --- a/StyleText/tools/synth_dataset.py +++ b/StyleText/tools/synth_dataset.py @@ -11,6 +11,14 @@ # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. + +import os +import sys + +__dir__ = os.path.dirname(os.path.abspath(__file__)) +sys.path.append(__dir__) +sys.path.append(os.path.abspath(os.path.join(__dir__, '..'))) + from engine.synthesisers import DatasetSynthesiser diff --git a/StyleText/tools/synth_image.py b/StyleText/tools/synth_image.py index 7b4827b825e4a28dd1fb2eba722d23e64e8ce0be..cbc3118675fd1d628bdfefee295acb4248128bd6 100644 --- a/StyleText/tools/synth_image.py +++ b/StyleText/tools/synth_image.py @@ -16,13 +16,13 @@ import cv2 import sys import glob -from utils.config import ArgsParser -from engine.synthesisers import ImageSynthesiser - __dir__ = os.path.dirname(os.path.abspath(__file__)) sys.path.append(__dir__) sys.path.append(os.path.abspath(os.path.join(__dir__, '..'))) +from utils.config import ArgsParser +from engine.synthesisers import ImageSynthesiser + def synth_image(): args = ArgsParser().parse_args()