README.md 402 字节
Newer Older
H
Hui Zhang 已提交
1 2 3 4 5 6 7 8 9 10
# Speech Application based on PaddleSpeech

The directory containes many speech applications in multi scenarios.

* audio tagging  - tag audio label in vedio  
* metaverse  - 2D AR with TTS  
* speech recogintion - vidio understanding  
* speech translation - end to end speech translation  
* story talker - book reader based on OCR and TTS  
* style_fs2 - multi style control for FastSpeech2 model