English | [简体中文](README_ch.md) # PaddleSpeech
ASR Module Type | Dataset | Model Type | Link |
---|---|---|---|
Acoustic Model | Aishell | 2 Conv + 5 LSTM layers with only forward direction | Ds2 Online Aishell Model |
2 Conv + 3 bidirectional GRU layers | Ds2 Offline Aishell Model | ||
Encoder:Conformer, Decoder:Transformer, Decoding method: Attention + CTC | Conformer Offline Aishell Model | ||
Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | Conformer Librispeech Model | ||
Librispeech | Encoder:Conformer, Decoder:Transformer, Decoding method: Attention | Conformer Librispeech Model | |
Encoder:Transformer, Decoder:Transformer, Decoding method: Attention | Transformer Librispeech Model | ||
Language Model | CommonCrawl(en.00) | English Language Model | English Language Model |
Baidu Internal Corpus | Mandarin Language Model Small | Mandarin Language Model Small | |
Mandarin Language Model Large | Mandarin Language Model Large |
TTS Module Type | Model Type | Dataset | Link |
---|---|---|---|
Text Frontend | chinese-fronted | ||
Acoustic Model | Tacotron2 | LJSpeech | tacotron2-vctk |
TransformerTTS | transformer-ljspeech | ||
SpeedySpeech | CSMSC | speedyspeech-csmsc | |
FastSpeech2 | AISHELL-3 | fastspeech2-aishell3 | |
VCTK | fastspeech2-vctk | ||
LJSpeech | fastspeech2-ljspeech | ||
CSMSC | fastspeech2-csmsc | ||
Vocoder | WaveFlow | LJSpeech | waveflow-ljspeech |
Parallel WaveGAN | LJSpeech | PWGAN-ljspeech | |
VCTK | PWGAN-vctk | ||
CSMSC | PWGAN-csmsc | ||
Voice Cloning | GE2E | AISHELL-3, etc. | ge2e |
GE2E + Tactron2 | AISHELL-3 | ge2e-tactron2-aishell3 |