提交 ac1b3016 编写于 作者: H huangyuxin

Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset

......@@ -21,5 +21,6 @@ python:
version: 3.7
install:
- requirements: docs/requirements.txt
- method: setuptools
path: .
system_packages: true
\ No newline at end of file
......@@ -691,7 +691,7 @@ PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声
- 非常感谢 [phecda-xu](https://github.com/phecda-xu)/[PaddleDubbing](https://github.com/phecda-xu/PaddleDubbing) 基于 PaddleSpeech 的 TTS 模型搭建带 GUI 操作界面的配音工具。
- 非常感谢 [jerryuhoo](https://github.com/jerryuhoo)/[VTuberTalk](https://github.com/jerryuhoo/VTuberTalk) 基于 PaddleSpeech 的 TTS GUI 界面和基于 ASR 制作数据集的相关代码。
- 非常感谢 [vpegasus](https://github.com/vpegasus)/[xuesebot](https://github.com/vpegasus/xuesebot) 基于 PaddleSpeech 的 ASR与TTS 设计的可听、说对话机器人
- 非常感谢 [vpegasus](https://github.com/vpegasus)/[xuesebot](https://github.com/vpegasus/xuesebot) 基于 PaddleSpeech 的 ASR 与 TTS 设计的可听、说对话机器人。
此外,PaddleSpeech 依赖于许多开源存储库。有关更多信息,请参阅 [references](./docs/source/reference.md)
......
......@@ -2,7 +2,7 @@ diskcache==5.2.1
dtaidistance==2.3.1
fastapi
librosa==0.8.0
numpy==1.21.0
numpy==1.22.0
pydantic
pymilvus==2.0.1
pymysql
......
([简体中文](./README_cn.md)|English)
# KWS (Keyword Spotting)
## Introduction
KWS(Keyword Spotting) is a technique to recognize keyword from a giving speech audio.
This demo is an implementation to recognize keyword from a specific audio file. It can be done by a single command or a few lines in python using `PaddleSpeech`.
## Usage
### 1. Installation
see [installation](https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/source/install.md).
You can choose one way from easy, meduim and hard to install paddlespeech.
### 2. Prepare Input File
The input of this demo should be a WAV file(`.wav`), and the sample rate must be the same as the model.
Here are sample files for this demo that can be downloaded:
```bash
wget -c https://paddlespeech.bj.bcebos.com/kws/hey_snips.wav https://paddlespeech.bj.bcebos.com/kws/non-keyword.wav
```
### 3. Usage
- Command Line(Recommended)
```bash
paddlespeech kws --input ./hey_snips.wav
paddlespeech kws --input ./non-keyword.wav
```
Usage:
```bash
paddlespeech kws --help
```
Arguments:
- `input`(required): Audio file to recognize.
- `threshold`:Score threshold for kws. Default: `0.8`.
- `model`: Model type of kws task. Default: `mdtc_heysnips`.
- `config`: Config of kws task. Use pretrained model when it is None. Default: `None`.
- `ckpt_path`: Model checkpoint. Use pretrained model when it is None. Default: `None`.
- `device`: Choose device to execute model inference. Default: default device of paddlepaddle in current environment.
- `verbose`: Show the log information.
Output:
```bash
# Input file: ./hey_snips.wav
Score: 1.000, Threshold: 0.8, Is keyword: True
# Input file: ./non-keyword.wav
Score: 0.000, Threshold: 0.8, Is keyword: False
```
- Python API
```python
import paddle
from paddlespeech.cli.kws import KWSExecutor
kws_executor = KWSExecutor()
result = kws_executor(
audio_file='./hey_snips.wav',
threshold=0.8,
model='mdtc_heysnips',
config=None,
ckpt_path=None,
device=paddle.get_device())
print('KWS Result: \n{}'.format(result))
```
Output:
```bash
KWS Result:
Score: 1.000, Threshold: 0.8, Is keyword: True
```
### 4.Pretrained Models
Here is a list of pretrained models released by PaddleSpeech that can be used by command and python API:
| Model | Language | Sample Rate
| :--- | :---: | :---: |
| mdtc_heysnips | en | 16k
(简体中文|[English](./README.md))
# 关键词识别
## 介绍
关键词识别是一项用于识别一段语音内是否包含特定的关键词。
这个 demo 是一个从给定音频文件识别特定关键词的实现,它可以通过使用 `PaddleSpeech` 的单个命令或 python 中的几行代码来实现。
## 使用方法
### 1. 安装
请看[安装文档](https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/source/install_cn.md)
你可以从 easy,medium,hard 三中方式中选择一种方式安装。
### 2. 准备输入
这个 demo 的输入应该是一个 WAV 文件(`.wav`),并且采样率必须与模型的采样率相同。
可以下载此 demo 的示例音频:
```bash
wget -c https://paddlespeech.bj.bcebos.com/kws/hey_snips.wav https://paddlespeech.bj.bcebos.com/kws/non-keyword.wav
```
### 3. 使用方法
- 命令行 (推荐使用)
```bash
paddlespeech kws --input ./hey_snips.wav
paddlespeech kws --input ./non-keyword.wav
```
使用方法:
```bash
paddlespeech kws --help
```
参数:
- `input`(必须输入):用于识别关键词的音频文件。
- `threshold`:用于判别是包含关键词的得分阈值,默认值:`0.8`
- `model`:KWS 任务的模型,默认值:`mdtc_heysnips`
- `config`:KWS 任务的参数文件,若不设置则使用预训练模型中的默认配置,默认值:`None`
- `ckpt_path`:模型参数文件,若不设置则下载预训练模型使用,默认值:`None`
- `device`:执行预测的设备,默认值:当前系统下 paddlepaddle 的默认 device。
- `verbose`: 如果使用,显示 logger 信息。
输出:
```bash
# 输入为 ./hey_snips.wav
Score: 1.000, Threshold: 0.8, Is keyword: True
# 输入为 ./non-keyword.wav
Score: 0.000, Threshold: 0.8, Is keyword: False
```
- Python API
```python
import paddle
from paddlespeech.cli.kws import KWSExecutor
kws_executor = KWSExecutor()
result = kws_executor(
audio_file='./hey_snips.wav',
threshold=0.8,
model='mdtc_heysnips',
config=None,
ckpt_path=None,
device=paddle.get_device())
print('KWS Result: \n{}'.format(result))
```
输出:
```bash
KWS Result:
Score: 1.000, Threshold: 0.8, Is keyword: True
```
### 4.预训练模型
以下是 PaddleSpeech 提供的可以被命令行和 python API 使用的预训练模型列表:
| 模型 | 语言 | 采样率
| :--- | :---: | :---: |
| mdtc_heysnips | en | 16k
#!/bin/bash
wget -c https://paddlespeech.bj.bcebos.com/kws/hey_snips.wav https://paddlespeech.bj.bcebos.com/kws/non-keyword.wav
# kws
paddlespeech kws --input ./hey_snips.wav
paddlespeech kws --input non-keyword.wav
......@@ -24,6 +24,12 @@ PaddleSpeechDemo是一个以PaddleSpeech的语音交互功能为主体开发的D
# 安装环境
cd speech_server
pip install -r requirements.txt
# 下载 ie 模型,针对地点进行微调,效果更好,不下载的话会使用其它版本,效果没有这个好
cd source
mkdir model
cd model
wget https://bj.bcebos.com/paddlenlp/applications/speech-cmd-analysis/finetune/model_state.pdparams
```
......@@ -61,59 +67,6 @@ yarn dev --port 8011
```
默认配置下,前端中配置的后台地址信息是localhost,确保后端服务器和打开页面的游览器在同一台机器上,不在一台机器的配置方式见下方的FAQ:【后端如果部署在其它机器或者别的端口如何修改】
## Docker启动
### 后端docker
后端docker使用[paddlepaddle官方docker](https://www.paddlepaddle.org.cn),这里演示CPU版本
```
# 拉取PaddleSpeech项目
cd PaddleSpeechServer
git clone https://github.com/PaddlePaddle/PaddleSpeech.git
# 拉取镜像
docker pull registry.baidubce.com/paddlepaddle/paddle:2.3.0
# 启动容器
docker run --name paddle -it -p 8010:8010 -v $PWD:/paddle registry.baidubce.com/paddlepaddle/paddle:2.3.0 /bin/bash
# 进入容器
cd /paddle
# 安装依赖
pip install -r requirements
# 启动服务
python main --port 8010
```
### 前端docker
前端docker直接使用[node官方的docker](https://hub.docker.com/_/node)即可
```shell
docker pull node
```
镜像中安装依赖
```shell
cd PaddleSpeechWebClient
# 映射外部8011端口
docker run -it -p 8011:8011 -v $PWD:/paddle node:latest bin/bash
# 进入容器中
cd /paddle
# 安装依赖
yarn install
# 启动前端
yarn dev --port 8011
```
## FAQ
#### Q: 如何安装node.js
......@@ -126,7 +79,7 @@ A:后端的配置地址有分散在两个文件中
修改第一个文件`PaddleSpeechWebClient/vite.config.js`
```json
```
server: {
host: "0.0.0.0",
proxy: {
......@@ -141,7 +94,7 @@ server: {
修改第二个文件`PaddleSpeechWebClient/src/api/API.js`(Websocket代理配置失败,所以需要在这个文件中修改)
```javascript
```
// websocket (这里改成后端所在的接口)
CHAT_SOCKET_RECORD: 'ws://localhost:8010/ws/asr/offlineStream', // ChatBot websocket 接口
ASR_SOCKET_RECORD: 'ws://localhost:8010/ws/asr/onlineStream', // Stream ASR 接口
......
......@@ -30,7 +30,7 @@ from src.robot import Robot
from src.WebsocketManeger import ConnectionManager
from src.SpeechBase.vpr import VPR
from paddlespeech.server.engine.asr.online.asr_engine import PaddleASRConnectionHanddler
from paddlespeech.server.engine.asr.online.python.asr_engine import PaddleASRConnectionHanddler
from paddlespeech.server.utils.audio_process import float2pcm
......
......@@ -145,29 +145,6 @@ class AudioMannger:
def resume(self):
self.is_pause = False
if __name__ == '__main__':
from robot import Robot
chatbot = Robot()
chatbot.init()
audio_manger = AudioMannger(chatbot)
file_list = [
"source/20220418145230qbenc.pcm",
]
for file in file_list:
with open(file, "rb") as f:
pcm_bin = f.read()
print(len(pcm_bin))
asr_ = audio_manger.stream_asr(pcm_bin=pcm_bin)
print(asr_)
print(audio_manger.end())
print(chatbot.speech2text("source/20220418145230zrxia.wav"))
\ No newline at end of file
......@@ -4,8 +4,8 @@ import paddle
import librosa
import soundfile
from paddlespeech.server.engine.asr.online.asr_engine import ASREngine
from paddlespeech.server.engine.asr.online.asr_engine import PaddleASRConnectionHanddler
from paddlespeech.server.engine.asr.online.python.asr_engine import ASREngine
from paddlespeech.server.engine.asr.online.python.asr_engine import PaddleASRConnectionHanddler
from paddlespeech.server.utils.config import get_config
def readWave(samples):
......@@ -59,29 +59,4 @@ class ASR:
self.connection_handler.reset()
return asr_results
if __name__ == '__main__':
config_path = r"../../PaddleSpeech/paddlespeech/server/conf/ws_conformer_application.yaml"
wav_path = r"../../source/demo/demo_16k.wav"
samples, sample_rate = soundfile.read(wav_path, dtype='int16')
asr = ASR(config_path=config_path)
end_result = asr.offlineASR(samples=samples, sample_rate=sample_rate)
print("端到端识别结果:", end_result)
for sub_wav in readWave(samples=samples):
# print(sub_wav)
message = sub_wav.tobytes()
offline_result = asr.onlineASR(message, is_finished=False)
print("流式识别结果: ", offline_result)
offline_result = asr.onlineASR(is_finished=True)
print("流式识别结果: ", offline_result)
\ No newline at end of file
......@@ -20,9 +20,4 @@ class NLP:
result = self.ie_model(text)
return result
if __name__ == '__main__':
ie_model_path = "../../source/model/"
nlp = NLP(ie_model_path=ie_model_path)
text = "今天早上我从大牛坊去百度科技园花了七百块钱"
print(nlp.ie(text))
\ No newline at end of file
......@@ -113,40 +113,4 @@ class DataBase(object):
b = base64.b64decode(vector_base64)
vc = np.frombuffer(b, dtype=dtype)
return vc
if __name__ == '__main__':
db_path = "../../source/db/vpr.sqlite"
db = DataBase(db_path)
# 准备数据
import numpy as np
vector = np.random.randn((192)).astype(np.float32).tobytes()
vector_base64 = base64.b64encode(vector).decode('utf8')
username = "sss"
wav_path = r"../../source/demo/demo_16k.wav"
# 插入数据
db.insert_one(username, vector_base64, wav_path)
# 查询数据
res_all = db.select_all()
print("res_all: ", res_all)
s_id = res_all[0]['id']
res_id = db.select_by_id(s_id)
print("res_id: ", res_id)
res_uername = db.select_by_username(username)
print("res_username: ", res_uername)
# base64还原
b = base64.b64decode(res_uername[0]['vector'])
vc = np.frombuffer(b, dtype=np.float32)
print(vc)
# 删除数据
db.drop_by_username(username)
res_all = db.select_all()
print("删除后 res_all: ", res_all)
db.drop_all()
\ No newline at end of file
......@@ -7,7 +7,8 @@
# 4. 流式推理
import base64
import math
import logging
import numpy as np
from paddlespeech.server.utils.onnx_infer import get_sess
from paddlespeech.t2s.frontend.zh_frontend import Frontend
......@@ -17,14 +18,14 @@ from paddlespeech.server.utils.config import get_config
from paddlespeech.server.engine.tts.online.onnx.tts_engine import TTSEngine
class TTS:
def __init__(self, config_path):
self.config = get_config(config_path)['tts_online-onnx']
self.config['voc_block'] = 36
self.engine = TTSEngine()
self.engine = TTSEngine()
self.engine.init(self.config)
self.engine.warm_up()
self.executor = self.engine.executor
#self.engine.warm_up()
# 前端初始化
self.frontend = Frontend(
......@@ -81,8 +82,105 @@ class TTS:
return wavs
def streamTTS(self, text):
for sub_wav_base64 in self.engine.run(sentence=text):
yield sub_wav_base64
get_tone_ids = False
merge_sentences = False
# front
input_ids = self.frontend.get_input_ids(
text,
merge_sentences=merge_sentences,
get_tone_ids=get_tone_ids)
phone_ids = input_ids["phone_ids"]
for i in range(len(phone_ids)):
part_phone_ids = phone_ids[i].numpy()
voc_chunk_id = 0
# fastspeech2_csmsc
if self.config.am == "fastspeech2_csmsc_onnx":
# am
mel = self.executor.am_sess.run(
output_names=None, input_feed={'text': part_phone_ids})
mel = mel[0]
# voc streaming
mel_chunks = get_chunks(mel, self.config.voc_block, self.config.voc_pad, "voc")
voc_chunk_num = len(mel_chunks)
for i, mel_chunk in enumerate(mel_chunks):
sub_wav = self.executor.voc_sess.run(
output_names=None, input_feed={'logmel': mel_chunk})
sub_wav = self.depadding(sub_wav[0], voc_chunk_num, i,
self.config.voc_block, self.config.voc_pad,
self.config.voc_upsample)
yield self.after_process(sub_wav)
# fastspeech2_cnndecoder_csmsc
elif self.config.am == "fastspeech2_cnndecoder_csmsc_onnx":
# am
orig_hs = self.executor.am_encoder_infer_sess.run(
None, input_feed={'text': part_phone_ids})
orig_hs = orig_hs[0]
# streaming voc chunk info
mel_len = orig_hs.shape[1]
voc_chunk_num = math.ceil(mel_len / self.config.voc_block)
start = 0
end = min(self.config.voc_block + self.config.voc_pad, mel_len)
# streaming am
hss = get_chunks(orig_hs, self.config.am_block, self.config.am_pad, "am")
am_chunk_num = len(hss)
for i, hs in enumerate(hss):
am_decoder_output = self.executor.am_decoder_sess.run(
None, input_feed={'xs': hs})
am_postnet_output = self.executor.am_postnet_sess.run(
None,
input_feed={
'xs': np.transpose(am_decoder_output[0], (0, 2, 1))
})
am_output_data = am_decoder_output + np.transpose(
am_postnet_output[0], (0, 2, 1))
normalized_mel = am_output_data[0][0]
sub_mel = denorm(normalized_mel, self.executor.am_mu,
self.executor.am_std)
sub_mel = self.depadding(sub_mel, am_chunk_num, i,
self.config.am_block, self.config.am_pad, 1)
if i == 0:
mel_streaming = sub_mel
else:
mel_streaming = np.concatenate(
(mel_streaming, sub_mel), axis=0)
# streaming voc
# 当流式AM推理的mel帧数大于流式voc推理的chunk size,开始进行流式voc 推理
while (mel_streaming.shape[0] >= end and
voc_chunk_id < voc_chunk_num):
voc_chunk = mel_streaming[start:end, :]
sub_wav = self.executor.voc_sess.run(
output_names=None, input_feed={'logmel': voc_chunk})
sub_wav = self.depadding(
sub_wav[0], voc_chunk_num, voc_chunk_id,
self.config.voc_block, self.config.voc_pad, self.config.voc_upsample)
yield self.after_process(sub_wav)
voc_chunk_id += 1
start = max(
0, voc_chunk_id * self.config.voc_block - self.config.voc_pad)
end = min(
(voc_chunk_id + 1) * self.config.voc_block + self.config.voc_pad,
mel_len)
else:
logging.error(
"Only support fastspeech2_csmsc or fastspeech2_cnndecoder_csmsc on streaming tts."
)
def streamTTSBytes(self, text):
for wav in self.engine.executor.infer(
......@@ -106,16 +204,6 @@ class TTS:
# 用 TVM 优化
pass
if __name__ == '__main__':
text = "啊哈哈哈哈哈哈啊哈哈哈哈哈哈啊哈哈哈哈哈哈啊哈哈哈哈哈哈啊哈哈哈哈哈哈"
config_path="../../PaddleSpeech/demos/streaming_tts_server/conf/tts_online_application.yaml"
tts = TTS(config_path)
for sub_wav in tts.streamTTS(text):
print("sub_wav_base64: ", len(sub_wav))
end_wav = tts.offlineTTS(text)
print(end_wav)
\ No newline at end of file
......@@ -116,37 +116,3 @@ class VPR:
# 清空 faiss
self.index_ip.reset()
if __name__ == '__main__':
db_path = "../../source/db/vpr.sqlite"
dim = 192
top_k = 5
vpr = VPR(db_path, dim, top_k)
# 准备测试数据
username = "sss"
wav_path = r"../../source/demo/demo_16k.wav"
# 注册声纹
vpr.vpr_enroll(username, wav_path)
# 获取数据
print(vpr.vpr_list())
# 识别声纹
recolist = vpr.vpr_recog(wav_path)
print(recolist)
# 通过 id 获取数据
idx = recolist[0][1]
print(vpr.vpr_data(idx))
# 删除声纹
vpr.vpr_del(username)
vpr.vpr_droptable()
\ No newline at end of file
from paddlespeech.cli import VectorExecutor
from paddlespeech.cli.vector import VectorExecutor
import numpy as np
import logging
......@@ -17,10 +17,4 @@ def get_audio_embedding(path):
logging.error(f"Error with embedding:{e}")
return None
if __name__ == '__main__':
audio_path = r"../../source/demo/demo_16k.wav"
emb = get_audio_embedding(audio_path)
print(emb.shape)
print(emb.dtype)
print(type(emb))
\ No newline at end of file
......@@ -35,7 +35,7 @@ class Robot:
# asr model初始化
self.asr_model(asr_init_path, model=self.asr_name,lang='zh',
sample_rate=16000)
sample_rate=16000, force_yes=True)
def speech2text(self, audio_file):
......@@ -67,27 +67,4 @@ class Robot:
result = self.nlp.ie(text)
return result
if __name__ == '__main__':
tts_config = "../PaddleSpeech/demos/streaming_tts_server/conf/tts_online_application.yaml"
asr_config = "../PaddleSpeech/demos/streaming_asr_server/conf/ws_conformer_application.yaml"
demo_wav = "../source/demo/demo_16k.wav"
ie_model_path = "../source/model"
tts_wav = "../source/demo/tts.wav"
text = "今天天气真不错"
ie_text = "今天晚上我从大牛坊出发去三里屯花了六十五块钱"
robot = Robot(asr_config, tts_config, asr_init_path=demo_wav)
res = robot.speech2text(demo_wav)
print(res)
res = robot.chat(text)
print(res)
print("tts offline")
robot.text2speech(res, tts_wav)
print("ie test")
res = robot.ie(ie_text)
print(res)
\ No newline at end of file
......@@ -5,3 +5,46 @@ sphinx
sphinx-autobuild
sphinx-markdown-tables
sphinx_rtd_theme
paddlepaddle>=2.2.2
editdistance
g2p_en
g2pM
h5py
inflect
jieba
jsonlines
kaldiio
librosa==0.8.1
loguru
matplotlib
nara_wpe
onnxruntime
pandas
paddlenlp
paddlespeech_feat
praatio==5.0.0
pypinyin
pypinyin-dict
python-dateutil
pyworld==0.2.12
resampy==0.2.2
sacrebleu
scipy
sentencepiece~=0.1.96
soundfile~=0.10
textgrid
timer
tqdm
typeguard
visualdl
webrtcvad
yacs~=0.1.8
prettytable
zhon
colorlog
pathos == 0.2.8
fastapi
websockets
keyboard
uvicorn
pattern_singleton
\ No newline at end of file
paddlespeech
============
.. toctree::
:maxdepth: 4
paddlespeech
paddlespeech.audio.backends package
===================================
.. automodule:: paddlespeech.audio.backends
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.backends.soundfile_backend
paddlespeech.audio.backends.sox_backend
paddlespeech.audio.backends.soundfile\_backend module
=====================================================
.. automodule:: paddlespeech.audio.backends.soundfile_backend
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.backends.sox\_backend module
===============================================
.. automodule:: paddlespeech.audio.backends.sox_backend
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.compliance.kaldi module
==========================================
.. automodule:: paddlespeech.audio.compliance.kaldi
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.compliance.librosa module
============================================
.. automodule:: paddlespeech.audio.compliance.librosa
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.compliance package
=====================================
.. automodule:: paddlespeech.audio.compliance
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.compliance.kaldi
paddlespeech.audio.compliance.librosa
paddlespeech.audio.datasets.dataset module
==========================================
.. automodule:: paddlespeech.audio.datasets.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.esc50 module
========================================
.. automodule:: paddlespeech.audio.datasets.esc50
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.gtzan module
========================================
.. automodule:: paddlespeech.audio.datasets.gtzan
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.hey\_snips module
=============================================
.. automodule:: paddlespeech.audio.datasets.hey_snips
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.rirs\_noises module
===============================================
.. automodule:: paddlespeech.audio.datasets.rirs_noises
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets package
===================================
.. automodule:: paddlespeech.audio.datasets
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.datasets.dataset
paddlespeech.audio.datasets.esc50
paddlespeech.audio.datasets.gtzan
paddlespeech.audio.datasets.hey_snips
paddlespeech.audio.datasets.rirs_noises
paddlespeech.audio.datasets.tess
paddlespeech.audio.datasets.urban_sound
paddlespeech.audio.datasets.voxceleb
paddlespeech.audio.datasets.tess module
=======================================
.. automodule:: paddlespeech.audio.datasets.tess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.urban\_sound module
===============================================
.. automodule:: paddlespeech.audio.datasets.urban_sound
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.datasets.voxceleb module
===========================================
.. automodule:: paddlespeech.audio.datasets.voxceleb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.features.layers module
=========================================
.. automodule:: paddlespeech.audio.features.layers
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.features package
===================================
.. automodule:: paddlespeech.audio.features
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.features.layers
paddlespeech.audio.functional.functional module
===============================================
.. automodule:: paddlespeech.audio.functional.functional
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.functional package
=====================================
.. automodule:: paddlespeech.audio.functional
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.functional.functional
paddlespeech.audio.functional.window
paddlespeech.audio.functional.window module
===========================================
.. automodule:: paddlespeech.audio.functional.window
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.io package
=============================
.. automodule:: paddlespeech.audio.io
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.metric.eer module
====================================
.. automodule:: paddlespeech.audio.metric.eer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.metric package
=================================
.. automodule:: paddlespeech.audio.metric
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.metric.eer
paddlespeech.audio package
==========================
.. automodule:: paddlespeech.audio
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.backends
paddlespeech.audio.compliance
paddlespeech.audio.datasets
paddlespeech.audio.features
paddlespeech.audio.functional
paddlespeech.audio.io
paddlespeech.audio.metric
paddlespeech.audio.sox_effects
paddlespeech.audio.utils
paddlespeech.audio.sox\_effects package
=======================================
.. automodule:: paddlespeech.audio.sox_effects
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.utils.download module
========================================
.. automodule:: paddlespeech.audio.utils.download
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.utils.error module
=====================================
.. automodule:: paddlespeech.audio.utils.error
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.utils.log module
===================================
.. automodule:: paddlespeech.audio.utils.log
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.utils.numeric module
=======================================
.. automodule:: paddlespeech.audio.utils.numeric
:members:
:undoc-members:
:show-inheritance:
paddlespeech.audio.utils package
================================
.. automodule:: paddlespeech.audio.utils
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.audio.utils.download
paddlespeech.audio.utils.error
paddlespeech.audio.utils.log
paddlespeech.audio.utils.numeric
paddlespeech.audio.utils.time
paddlespeech.audio.utils.time module
====================================
.. automodule:: paddlespeech.audio.utils.time
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.asr.infer module
=================================
.. automodule:: paddlespeech.cli.asr.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.asr package
============================
.. automodule:: paddlespeech.cli.asr
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.asr.infer
paddlespeech.cli.base\_commands module
======================================
.. automodule:: paddlespeech.cli.base_commands
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.cls.infer module
=================================
.. automodule:: paddlespeech.cli.cls.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.cls package
============================
.. automodule:: paddlespeech.cli.cls
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.cls.infer
paddlespeech.cli.download module
================================
.. automodule:: paddlespeech.cli.download
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.entry module
=============================
.. automodule:: paddlespeech.cli.entry
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.executor module
================================
.. automodule:: paddlespeech.cli.executor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.kws.infer module
=================================
.. automodule:: paddlespeech.cli.kws.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.kws package
============================
.. automodule:: paddlespeech.cli.kws
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.kws.infer
paddlespeech.cli.log module
===========================
.. automodule:: paddlespeech.cli.log
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli package
========================
.. automodule:: paddlespeech.cli
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.asr
paddlespeech.cli.cls
paddlespeech.cli.kws
paddlespeech.cli.st
paddlespeech.cli.text
paddlespeech.cli.tts
paddlespeech.cli.vector
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.base_commands
paddlespeech.cli.download
paddlespeech.cli.entry
paddlespeech.cli.executor
paddlespeech.cli.log
paddlespeech.cli.utils
paddlespeech.cli.st.infer module
================================
.. automodule:: paddlespeech.cli.st.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.st package
===========================
.. automodule:: paddlespeech.cli.st
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.st.infer
paddlespeech.cli.text.infer module
==================================
.. automodule:: paddlespeech.cli.text.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.text package
=============================
.. automodule:: paddlespeech.cli.text
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.text.infer
paddlespeech.cli.tts.infer module
=================================
.. automodule:: paddlespeech.cli.tts.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.tts package
============================
.. automodule:: paddlespeech.cli.tts
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.tts.infer
paddlespeech.cli.utils module
=============================
.. automodule:: paddlespeech.cli.utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.vector.infer module
====================================
.. automodule:: paddlespeech.cli.vector.infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cli.vector package
===============================
.. automodule:: paddlespeech.cli.vector
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cli.vector.infer
paddlespeech.cls.exps.panns.deploy.predict module
=================================================
.. automodule:: paddlespeech.cls.exps.panns.deploy.predict
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.exps.panns.deploy package
==========================================
.. automodule:: paddlespeech.cls.exps.panns.deploy
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.exps.panns.deploy.predict
paddlespeech.cls.exps.panns.export\_model module
================================================
.. automodule:: paddlespeech.cls.exps.panns.export_model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.exps.panns.predict module
==========================================
.. automodule:: paddlespeech.cls.exps.panns.predict
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.exps.panns package
===================================
.. automodule:: paddlespeech.cls.exps.panns
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.exps.panns.deploy
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.exps.panns.export_model
paddlespeech.cls.exps.panns.predict
paddlespeech.cls.exps.panns.train
paddlespeech.cls.exps.panns.train module
========================================
.. automodule:: paddlespeech.cls.exps.panns.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.exps package
=============================
.. automodule:: paddlespeech.cls.exps
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.exps.panns
paddlespeech.cls.models.panns.classifier module
===============================================
.. automodule:: paddlespeech.cls.models.panns.classifier
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.models.panns.panns module
==========================================
.. automodule:: paddlespeech.cls.models.panns.panns
:members:
:undoc-members:
:show-inheritance:
paddlespeech.cls.models.panns package
=====================================
.. automodule:: paddlespeech.cls.models.panns
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.models.panns.classifier
paddlespeech.cls.models.panns.panns
paddlespeech.cls.models package
===============================
.. automodule:: paddlespeech.cls.models
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.models.panns
paddlespeech.cls package
========================
.. automodule:: paddlespeech.cls
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.cls.exps
paddlespeech.cls.models
paddlespeech.kws.models.loss module
===================================
.. automodule:: paddlespeech.kws.models.loss
:members:
:undoc-members:
:show-inheritance:
paddlespeech.kws.models.mdtc module
===================================
.. automodule:: paddlespeech.kws.models.mdtc
:members:
:undoc-members:
:show-inheritance:
paddlespeech.kws.models package
===============================
.. automodule:: paddlespeech.kws.models
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.kws.models.loss
paddlespeech.kws.models.mdtc
paddlespeech.kws package
========================
.. automodule:: paddlespeech.kws
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.kws.models
paddlespeech package
====================
.. automodule:: paddlespeech
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.audio
paddlespeech.cli
paddlespeech.cls
paddlespeech.kws
paddlespeech.s2t
paddlespeech.server
paddlespeech.t2s
paddlespeech.text
paddlespeech.vector
paddlespeech.s2t.decoders.beam\_search.batch\_beam\_search module
=================================================================
.. automodule:: paddlespeech.s2t.decoders.beam_search.batch_beam_search
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.beam\_search.beam\_search module
==========================================================
.. automodule:: paddlespeech.s2t.decoders.beam_search.beam_search
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.beam\_search package
==============================================
.. automodule:: paddlespeech.s2t.decoders.beam_search
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders.beam_search.batch_beam_search
paddlespeech.s2t.decoders.beam_search.beam_search
paddlespeech.s2t.decoders.ctcdecoder.decoders\_deprecated module
================================================================
.. automodule:: paddlespeech.s2t.decoders.ctcdecoder.decoders_deprecated
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.ctcdecoder package
============================================
.. automodule:: paddlespeech.s2t.decoders.ctcdecoder
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders.ctcdecoder.decoders_deprecated
paddlespeech.s2t.decoders.ctcdecoder.scorer_deprecated
paddlespeech.s2t.decoders.ctcdecoder.swig_wrapper
paddlespeech.s2t.decoders.ctcdecoder.scorer\_deprecated module
==============================================================
.. automodule:: paddlespeech.s2t.decoders.ctcdecoder.scorer_deprecated
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.ctcdecoder.swig\_wrapper module
=========================================================
.. automodule:: paddlespeech.s2t.decoders.ctcdecoder.swig_wrapper
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.recog module
======================================
.. automodule:: paddlespeech.s2t.decoders.recog
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.recog\_bin module
===========================================
.. automodule:: paddlespeech.s2t.decoders.recog_bin
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders package
=================================
.. automodule:: paddlespeech.s2t.decoders
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders.beam_search
paddlespeech.s2t.decoders.ctcdecoder
paddlespeech.s2t.decoders.scorers
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders.recog
paddlespeech.s2t.decoders.recog_bin
paddlespeech.s2t.decoders.utils
paddlespeech.s2t.decoders.scorers.ctc module
============================================
.. automodule:: paddlespeech.s2t.decoders.scorers.ctc
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.scorers.ctc\_prefix\_score module
===========================================================
.. automodule:: paddlespeech.s2t.decoders.scorers.ctc_prefix_score
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.scorers.length\_bonus module
======================================================
.. automodule:: paddlespeech.s2t.decoders.scorers.length_bonus
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.scorers.ngram module
==============================================
.. automodule:: paddlespeech.s2t.decoders.scorers.ngram
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.scorers package
=========================================
.. automodule:: paddlespeech.s2t.decoders.scorers
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders.scorers.ctc
paddlespeech.s2t.decoders.scorers.ctc_prefix_score
paddlespeech.s2t.decoders.scorers.length_bonus
paddlespeech.s2t.decoders.scorers.ngram
paddlespeech.s2t.decoders.scorers.scorer_interface
paddlespeech.s2t.decoders.scorers.scorer\_interface module
==========================================================
.. automodule:: paddlespeech.s2t.decoders.scorers.scorer_interface
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.decoders.utils module
======================================
.. automodule:: paddlespeech.s2t.decoders.utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.deploy.client module
==========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy.client
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.deploy.record module
==========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy.record
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.deploy package
====================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2.bin.deploy.client
paddlespeech.s2t.exps.deepspeech2.bin.deploy.record
paddlespeech.s2t.exps.deepspeech2.bin.deploy.runtime
paddlespeech.s2t.exps.deepspeech2.bin.deploy.send
paddlespeech.s2t.exps.deepspeech2.bin.deploy.server
paddlespeech.s2t.exps.deepspeech2.bin.deploy.runtime module
===========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy.runtime
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.deploy.send module
========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy.send
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.deploy.server module
==========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.deploy.server
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.export module
===================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.export
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin package
=============================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2.bin.deploy
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2.bin.export
paddlespeech.s2t.exps.deepspeech2.bin.test
paddlespeech.s2t.exps.deepspeech2.bin.test_export
paddlespeech.s2t.exps.deepspeech2.bin.test_wav
paddlespeech.s2t.exps.deepspeech2.bin.train
paddlespeech.s2t.exps.deepspeech2.bin.test module
=================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.test
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.test\_export module
=========================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.test_export
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.test\_wav module
======================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.test_wav
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.bin.train module
==================================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.bin.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2.model module
==============================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2.model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.deepspeech2 package
=========================================
.. automodule:: paddlespeech.s2t.exps.deepspeech2
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2.bin
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2.model
paddlespeech.s2t.exps package
=============================
.. automodule:: paddlespeech.s2t.exps
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.deepspeech2
paddlespeech.s2t.exps.u2
paddlespeech.s2t.exps.u2_kaldi
paddlespeech.s2t.exps.u2_st
paddlespeech.s2t.exps.u2.bin.alignment module
=============================================
.. automodule:: paddlespeech.s2t.exps.u2.bin.alignment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2.bin.export module
==========================================
.. automodule:: paddlespeech.s2t.exps.u2.bin.export
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2.bin package
====================================
.. automodule:: paddlespeech.s2t.exps.u2.bin
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2.bin.alignment
paddlespeech.s2t.exps.u2.bin.export
paddlespeech.s2t.exps.u2.bin.test
paddlespeech.s2t.exps.u2.bin.test_wav
paddlespeech.s2t.exps.u2.bin.train
paddlespeech.s2t.exps.u2.bin.test module
========================================
.. automodule:: paddlespeech.s2t.exps.u2.bin.test
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2.bin.test\_wav module
=============================================
.. automodule:: paddlespeech.s2t.exps.u2.bin.test_wav
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2.bin.train module
=========================================
.. automodule:: paddlespeech.s2t.exps.u2.bin.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2.model module
=====================================
.. automodule:: paddlespeech.s2t.exps.u2.model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2 package
================================
.. automodule:: paddlespeech.s2t.exps.u2
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2.bin
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2.model
paddlespeech.s2t.exps.u2.trainer
paddlespeech.s2t.exps.u2.trainer module
=======================================
.. automodule:: paddlespeech.s2t.exps.u2.trainer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_kaldi.bin.recog module
================================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi.bin.recog
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_kaldi.bin package
===========================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi.bin
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_kaldi.bin.recog
paddlespeech.s2t.exps.u2_kaldi.bin.test
paddlespeech.s2t.exps.u2_kaldi.bin.train
paddlespeech.s2t.exps.u2\_kaldi.bin.test module
===============================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi.bin.test
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_kaldi.bin.train module
================================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi.bin.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_kaldi.model module
============================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi.model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_kaldi package
=======================================
.. automodule:: paddlespeech.s2t.exps.u2_kaldi
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_kaldi.bin
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_kaldi.model
paddlespeech.s2t.exps.u2\_st.bin.export module
==============================================
.. automodule:: paddlespeech.s2t.exps.u2_st.bin.export
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_st.bin package
========================================
.. automodule:: paddlespeech.s2t.exps.u2_st.bin
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_st.bin.export
paddlespeech.s2t.exps.u2_st.bin.test
paddlespeech.s2t.exps.u2_st.bin.train
paddlespeech.s2t.exps.u2\_st.bin.test module
============================================
.. automodule:: paddlespeech.s2t.exps.u2_st.bin.test
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_st.bin.train module
=============================================
.. automodule:: paddlespeech.s2t.exps.u2_st.bin.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_st.model module
=========================================
.. automodule:: paddlespeech.s2t.exps.u2_st.model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.exps.u2\_st package
====================================
.. automodule:: paddlespeech.s2t.exps.u2_st
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_st.bin
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.exps.u2_st.model
paddlespeech.s2t.frontend.audio module
======================================
.. automodule:: paddlespeech.s2t.frontend.audio
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.augmentation module
=======================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.augmentation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.base module
===============================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.base
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.impulse\_response module
============================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.impulse_response
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.noise\_perturb module
=========================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.noise_perturb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.online\_bayesian\_normalization module
==========================================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.online_bayesian_normalization
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.resample module
===================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.resample
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor package
===========================================
.. automodule:: paddlespeech.s2t.frontend.augmentor
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.frontend.augmentor.augmentation
paddlespeech.s2t.frontend.augmentor.base
paddlespeech.s2t.frontend.augmentor.impulse_response
paddlespeech.s2t.frontend.augmentor.noise_perturb
paddlespeech.s2t.frontend.augmentor.online_bayesian_normalization
paddlespeech.s2t.frontend.augmentor.resample
paddlespeech.s2t.frontend.augmentor.shift_perturb
paddlespeech.s2t.frontend.augmentor.spec_augment
paddlespeech.s2t.frontend.augmentor.speed_perturb
paddlespeech.s2t.frontend.augmentor.volume_perturb
paddlespeech.s2t.frontend.augmentor.shift\_perturb module
=========================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.shift_perturb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.spec\_augment module
========================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.spec_augment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.speed\_perturb module
=========================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.speed_perturb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.augmentor.volume\_perturb module
==========================================================
.. automodule:: paddlespeech.s2t.frontend.augmentor.volume_perturb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.featurizer.audio\_featurizer module
=============================================================
.. automodule:: paddlespeech.s2t.frontend.featurizer.audio_featurizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.featurizer package
============================================
.. automodule:: paddlespeech.s2t.frontend.featurizer
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.frontend.featurizer.audio_featurizer
paddlespeech.s2t.frontend.featurizer.speech_featurizer
paddlespeech.s2t.frontend.featurizer.text_featurizer
paddlespeech.s2t.frontend.featurizer.speech\_featurizer module
==============================================================
.. automodule:: paddlespeech.s2t.frontend.featurizer.speech_featurizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.featurizer.text\_featurizer module
============================================================
.. automodule:: paddlespeech.s2t.frontend.featurizer.text_featurizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.normalizer module
===========================================
.. automodule:: paddlespeech.s2t.frontend.normalizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend package
=================================
.. automodule:: paddlespeech.s2t.frontend
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.frontend.augmentor
paddlespeech.s2t.frontend.featurizer
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.frontend.audio
paddlespeech.s2t.frontend.normalizer
paddlespeech.s2t.frontend.speech
paddlespeech.s2t.frontend.utility
paddlespeech.s2t.frontend.speech module
=======================================
.. automodule:: paddlespeech.s2t.frontend.speech
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.frontend.utility module
========================================
.. automodule:: paddlespeech.s2t.frontend.utility
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.batchfy module
==================================
.. automodule:: paddlespeech.s2t.io.batchfy
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.collator module
===================================
.. automodule:: paddlespeech.s2t.io.collator
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.converter module
====================================
.. automodule:: paddlespeech.s2t.io.converter
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.dataloader module
=====================================
.. automodule:: paddlespeech.s2t.io.dataloader
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.dataset module
==================================
.. automodule:: paddlespeech.s2t.io.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.reader module
=================================
.. automodule:: paddlespeech.s2t.io.reader
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io package
===========================
.. automodule:: paddlespeech.s2t.io
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.io.batchfy
paddlespeech.s2t.io.collator
paddlespeech.s2t.io.converter
paddlespeech.s2t.io.dataloader
paddlespeech.s2t.io.dataset
paddlespeech.s2t.io.reader
paddlespeech.s2t.io.sampler
paddlespeech.s2t.io.utility
paddlespeech.s2t.io.sampler module
==================================
.. automodule:: paddlespeech.s2t.io.sampler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.io.utility module
==================================
.. automodule:: paddlespeech.s2t.io.utility
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.asr\_interface module
=============================================
.. automodule:: paddlespeech.s2t.models.asr_interface
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.ds2.conv module
=======================================
.. automodule:: paddlespeech.s2t.models.ds2.conv
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.ds2.deepspeech2 module
==============================================
.. automodule:: paddlespeech.s2t.models.ds2.deepspeech2
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.ds2 package
===================================
.. automodule:: paddlespeech.s2t.models.ds2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.ds2.conv
paddlespeech.s2t.models.ds2.deepspeech2
paddlespeech.s2t.models.lm.dataset module
=========================================
.. automodule:: paddlespeech.s2t.models.lm.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.lm package
==================================
.. automodule:: paddlespeech.s2t.models.lm
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.lm.dataset
paddlespeech.s2t.models.lm.transformer
paddlespeech.s2t.models.lm.transformer module
=============================================
.. automodule:: paddlespeech.s2t.models.lm.transformer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.lm\_interface module
============================================
.. automodule:: paddlespeech.s2t.models.lm_interface
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models package
===============================
.. automodule:: paddlespeech.s2t.models
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.ds2
paddlespeech.s2t.models.lm
paddlespeech.s2t.models.u2
paddlespeech.s2t.models.u2_st
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.asr_interface
paddlespeech.s2t.models.lm_interface
paddlespeech.s2t.models.st_interface
paddlespeech.s2t.models.st\_interface module
============================================
.. automodule:: paddlespeech.s2t.models.st_interface
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.u2 package
==================================
.. automodule:: paddlespeech.s2t.models.u2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.u2.u2
paddlespeech.s2t.models.u2.updater
paddlespeech.s2t.models.u2.u2 module
====================================
.. automodule:: paddlespeech.s2t.models.u2.u2
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.u2.updater module
=========================================
.. automodule:: paddlespeech.s2t.models.u2.updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.models.u2\_st package
======================================
.. automodule:: paddlespeech.s2t.models.u2_st
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.models.u2_st.u2_st
paddlespeech.s2t.models.u2\_st.u2\_st module
============================================
.. automodule:: paddlespeech.s2t.models.u2_st.u2_st
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.activation module
==========================================
.. automodule:: paddlespeech.s2t.modules.activation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.align module
=====================================
.. automodule:: paddlespeech.s2t.modules.align
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.attention module
=========================================
.. automodule:: paddlespeech.s2t.modules.attention
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.cmvn module
====================================
.. automodule:: paddlespeech.s2t.modules.cmvn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.conformer\_convolution module
======================================================
.. automodule:: paddlespeech.s2t.modules.conformer_convolution
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.crf module
===================================
.. automodule:: paddlespeech.s2t.modules.crf
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.ctc module
===================================
.. automodule:: paddlespeech.s2t.modules.ctc
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.decoder module
=======================================
.. automodule:: paddlespeech.s2t.modules.decoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.decoder\_layer module
==============================================
.. automodule:: paddlespeech.s2t.modules.decoder_layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.embedding module
=========================================
.. automodule:: paddlespeech.s2t.modules.embedding
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.encoder module
=======================================
.. automodule:: paddlespeech.s2t.modules.encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.encoder\_layer module
==============================================
.. automodule:: paddlespeech.s2t.modules.encoder_layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.initializer module
===========================================
.. automodule:: paddlespeech.s2t.modules.initializer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.loss module
====================================
.. automodule:: paddlespeech.s2t.modules.loss
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.mask module
====================================
.. automodule:: paddlespeech.s2t.modules.mask
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules.positionwise\_feed\_forward module
===========================================================
.. automodule:: paddlespeech.s2t.modules.positionwise_feed_forward
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.modules package
================================
.. automodule:: paddlespeech.s2t.modules
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.modules.activation
paddlespeech.s2t.modules.align
paddlespeech.s2t.modules.attention
paddlespeech.s2t.modules.cmvn
paddlespeech.s2t.modules.conformer_convolution
paddlespeech.s2t.modules.crf
paddlespeech.s2t.modules.ctc
paddlespeech.s2t.modules.decoder
paddlespeech.s2t.modules.decoder_layer
paddlespeech.s2t.modules.embedding
paddlespeech.s2t.modules.encoder
paddlespeech.s2t.modules.encoder_layer
paddlespeech.s2t.modules.initializer
paddlespeech.s2t.modules.loss
paddlespeech.s2t.modules.mask
paddlespeech.s2t.modules.positionwise_feed_forward
paddlespeech.s2t.modules.subsampling
paddlespeech.s2t.modules.subsampling module
===========================================
.. automodule:: paddlespeech.s2t.modules.subsampling
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t package
========================
.. automodule:: paddlespeech.s2t
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.decoders
paddlespeech.s2t.exps
paddlespeech.s2t.frontend
paddlespeech.s2t.io
paddlespeech.s2t.models
paddlespeech.s2t.modules
paddlespeech.s2t.training
paddlespeech.s2t.transform
paddlespeech.s2t.utils
paddlespeech.s2t.training.cli module
====================================
.. automodule:: paddlespeech.s2t.training.cli
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.extensions.evaluator module
=====================================================
.. automodule:: paddlespeech.s2t.training.extensions.evaluator
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.extensions.extension module
=====================================================
.. automodule:: paddlespeech.s2t.training.extensions.extension
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.extensions.plot module
================================================
.. automodule:: paddlespeech.s2t.training.extensions.plot
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.extensions package
============================================
.. automodule:: paddlespeech.s2t.training.extensions
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.training.extensions.evaluator
paddlespeech.s2t.training.extensions.extension
paddlespeech.s2t.training.extensions.plot
paddlespeech.s2t.training.extensions.snapshot
paddlespeech.s2t.training.extensions.visualizer
paddlespeech.s2t.training.extensions.snapshot module
====================================================
.. automodule:: paddlespeech.s2t.training.extensions.snapshot
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.extensions.visualizer module
======================================================
.. automodule:: paddlespeech.s2t.training.extensions.visualizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.gradclip module
=========================================
.. automodule:: paddlespeech.s2t.training.gradclip
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.optimizer module
==========================================
.. automodule:: paddlespeech.s2t.training.optimizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.reporter module
=========================================
.. automodule:: paddlespeech.s2t.training.reporter
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training package
=================================
.. automodule:: paddlespeech.s2t.training
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.training.extensions
paddlespeech.s2t.training.triggers
paddlespeech.s2t.training.updaters
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.training.cli
paddlespeech.s2t.training.gradclip
paddlespeech.s2t.training.optimizer
paddlespeech.s2t.training.reporter
paddlespeech.s2t.training.scheduler
paddlespeech.s2t.training.timer
paddlespeech.s2t.training.trainer
paddlespeech.s2t.training.scheduler module
==========================================
.. automodule:: paddlespeech.s2t.training.scheduler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.timer module
======================================
.. automodule:: paddlespeech.s2t.training.timer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.trainer module
========================================
.. automodule:: paddlespeech.s2t.training.trainer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.triggers.compare\_value\_trigger module
=================================================================
.. automodule:: paddlespeech.s2t.training.triggers.compare_value_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.triggers.interval\_trigger module
===========================================================
.. automodule:: paddlespeech.s2t.training.triggers.interval_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.triggers.limit\_trigger module
========================================================
.. automodule:: paddlespeech.s2t.training.triggers.limit_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.triggers package
==========================================
.. automodule:: paddlespeech.s2t.training.triggers
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.training.triggers.compare_value_trigger
paddlespeech.s2t.training.triggers.interval_trigger
paddlespeech.s2t.training.triggers.limit_trigger
paddlespeech.s2t.training.triggers.time_trigger
paddlespeech.s2t.training.triggers.utils
paddlespeech.s2t.training.triggers.time\_trigger module
=======================================================
.. automodule:: paddlespeech.s2t.training.triggers.time_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.triggers.utils module
===============================================
.. automodule:: paddlespeech.s2t.training.triggers.utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.updaters package
==========================================
.. automodule:: paddlespeech.s2t.training.updaters
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.training.updaters.standard_updater
paddlespeech.s2t.training.updaters.trainer
paddlespeech.s2t.training.updaters.updater
paddlespeech.s2t.training.updaters.standard\_updater module
===========================================================
.. automodule:: paddlespeech.s2t.training.updaters.standard_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.updaters.trainer module
=================================================
.. automodule:: paddlespeech.s2t.training.updaters.trainer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.training.updaters.updater module
=================================================
.. automodule:: paddlespeech.s2t.training.updaters.updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.add\_deltas module
=============================================
.. automodule:: paddlespeech.s2t.transform.add_deltas
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.channel\_selector module
===================================================
.. automodule:: paddlespeech.s2t.transform.channel_selector
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.cmvn module
======================================
.. automodule:: paddlespeech.s2t.transform.cmvn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.functional module
============================================
.. automodule:: paddlespeech.s2t.transform.functional
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.perturb module
=========================================
.. automodule:: paddlespeech.s2t.transform.perturb
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform package
==================================
.. automodule:: paddlespeech.s2t.transform
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.transform.add_deltas
paddlespeech.s2t.transform.channel_selector
paddlespeech.s2t.transform.cmvn
paddlespeech.s2t.transform.functional
paddlespeech.s2t.transform.perturb
paddlespeech.s2t.transform.spec_augment
paddlespeech.s2t.transform.spectrogram
paddlespeech.s2t.transform.transform_interface
paddlespeech.s2t.transform.transformation
paddlespeech.s2t.transform.wpe
paddlespeech.s2t.transform.spec\_augment module
===============================================
.. automodule:: paddlespeech.s2t.transform.spec_augment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.spectrogram module
=============================================
.. automodule:: paddlespeech.s2t.transform.spectrogram
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.transform\_interface module
======================================================
.. automodule:: paddlespeech.s2t.transform.transform_interface
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.transformation module
================================================
.. automodule:: paddlespeech.s2t.transform.transformation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.transform.wpe module
=====================================
.. automodule:: paddlespeech.s2t.transform.wpe
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.asr\_utils module
========================================
.. automodule:: paddlespeech.s2t.utils.asr_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.bleu\_score module
=========================================
.. automodule:: paddlespeech.s2t.utils.bleu_score
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.check\_kwargs module
===========================================
.. automodule:: paddlespeech.s2t.utils.check_kwargs
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.checkpoint module
========================================
.. automodule:: paddlespeech.s2t.utils.checkpoint
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.cli\_readers module
==========================================
.. automodule:: paddlespeech.s2t.utils.cli_readers
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.cli\_utils module
========================================
.. automodule:: paddlespeech.s2t.utils.cli_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.cli\_writers module
==========================================
.. automodule:: paddlespeech.s2t.utils.cli_writers
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.ctc\_utils module
========================================
.. automodule:: paddlespeech.s2t.utils.ctc_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.dynamic\_import module
=============================================
.. automodule:: paddlespeech.s2t.utils.dynamic_import
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.dynamic\_pip\_install module
===================================================
.. automodule:: paddlespeech.s2t.utils.dynamic_pip_install
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.error\_rate module
=========================================
.. automodule:: paddlespeech.s2t.utils.error_rate
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.layer\_tools module
==========================================
.. automodule:: paddlespeech.s2t.utils.layer_tools
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.log module
=================================
.. automodule:: paddlespeech.s2t.utils.log
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.mp\_tools module
=======================================
.. automodule:: paddlespeech.s2t.utils.mp_tools
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.profiler module
======================================
.. automodule:: paddlespeech.s2t.utils.profiler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils package
==============================
.. automodule:: paddlespeech.s2t.utils
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.s2t.utils.asr_utils
paddlespeech.s2t.utils.bleu_score
paddlespeech.s2t.utils.check_kwargs
paddlespeech.s2t.utils.checkpoint
paddlespeech.s2t.utils.cli_readers
paddlespeech.s2t.utils.cli_utils
paddlespeech.s2t.utils.cli_writers
paddlespeech.s2t.utils.ctc_utils
paddlespeech.s2t.utils.dynamic_import
paddlespeech.s2t.utils.dynamic_pip_install
paddlespeech.s2t.utils.error_rate
paddlespeech.s2t.utils.layer_tools
paddlespeech.s2t.utils.log
paddlespeech.s2t.utils.mp_tools
paddlespeech.s2t.utils.profiler
paddlespeech.s2t.utils.socket_server
paddlespeech.s2t.utils.spec_augment
paddlespeech.s2t.utils.tensor_utils
paddlespeech.s2t.utils.text_grid
paddlespeech.s2t.utils.utility
paddlespeech.s2t.utils.socket\_server module
============================================
.. automodule:: paddlespeech.s2t.utils.socket_server
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.spec\_augment module
===========================================
.. automodule:: paddlespeech.s2t.utils.spec_augment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.tensor\_utils module
===========================================
.. automodule:: paddlespeech.s2t.utils.tensor_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.text\_grid module
========================================
.. automodule:: paddlespeech.s2t.utils.text_grid
:members:
:undoc-members:
:show-inheritance:
paddlespeech.s2t.utils.utility module
=====================================
.. automodule:: paddlespeech.s2t.utils.utility
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.base\_commands module
=========================================
.. automodule:: paddlespeech.server.base_commands
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.bin.paddlespeech\_client module
===================================================
.. automodule:: paddlespeech.server.bin.paddlespeech_client
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.bin.paddlespeech\_server module
===================================================
.. automodule:: paddlespeech.server.bin.paddlespeech_server
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.bin package
===============================
.. automodule:: paddlespeech.server.bin
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.bin.paddlespeech_client
paddlespeech.server.bin.paddlespeech_server
paddlespeech.server.engine.acs.python.acs\_engine module
========================================================
.. automodule:: paddlespeech.server.engine.acs.python.acs_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.acs.python package
=============================================
.. automodule:: paddlespeech.server.engine.acs.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.acs.python.acs_engine
paddlespeech.server.engine.acs package
======================================
.. automodule:: paddlespeech.server.engine.acs
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.acs.python
paddlespeech.server.engine.asr.online.ctc\_endpoint module
==========================================================
.. automodule:: paddlespeech.server.engine.asr.online.ctc_endpoint
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.online.ctc\_search module
========================================================
.. automodule:: paddlespeech.server.engine.asr.online.ctc_search
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.online.onnx.asr\_engine module
=============================================================
.. automodule:: paddlespeech.server.engine.asr.online.onnx.asr_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.online.onnx package
==================================================
.. automodule:: paddlespeech.server.engine.asr.online.onnx
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online.onnx.asr_engine
paddlespeech.server.engine.asr.online.paddleinference.asr\_engine module
========================================================================
.. automodule:: paddlespeech.server.engine.asr.online.paddleinference.asr_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.online.paddleinference package
=============================================================
.. automodule:: paddlespeech.server.engine.asr.online.paddleinference
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online.paddleinference.asr_engine
paddlespeech.server.engine.asr.online.python.asr\_engine module
===============================================================
.. automodule:: paddlespeech.server.engine.asr.online.python.asr_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.online.python package
====================================================
.. automodule:: paddlespeech.server.engine.asr.online.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online.python.asr_engine
paddlespeech.server.engine.asr.online package
=============================================
.. automodule:: paddlespeech.server.engine.asr.online
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online.onnx
paddlespeech.server.engine.asr.online.paddleinference
paddlespeech.server.engine.asr.online.python
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online.ctc_endpoint
paddlespeech.server.engine.asr.online.ctc_search
paddlespeech.server.engine.asr.paddleinference.asr\_engine module
=================================================================
.. automodule:: paddlespeech.server.engine.asr.paddleinference.asr_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.paddleinference package
======================================================
.. automodule:: paddlespeech.server.engine.asr.paddleinference
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.paddleinference.asr_engine
paddlespeech.server.engine.asr.python.asr\_engine module
========================================================
.. automodule:: paddlespeech.server.engine.asr.python.asr_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.asr.python package
=============================================
.. automodule:: paddlespeech.server.engine.asr.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.python.asr_engine
paddlespeech.server.engine.asr package
======================================
.. automodule:: paddlespeech.server.engine.asr
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.asr.online
paddlespeech.server.engine.asr.paddleinference
paddlespeech.server.engine.asr.python
paddlespeech.server.engine.base\_engine module
==============================================
.. automodule:: paddlespeech.server.engine.base_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.cls.paddleinference.cls\_engine module
=================================================================
.. automodule:: paddlespeech.server.engine.cls.paddleinference.cls_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.cls.paddleinference package
======================================================
.. automodule:: paddlespeech.server.engine.cls.paddleinference
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.cls.paddleinference.cls_engine
paddlespeech.server.engine.cls.python.cls\_engine module
========================================================
.. automodule:: paddlespeech.server.engine.cls.python.cls_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.cls.python package
=============================================
.. automodule:: paddlespeech.server.engine.cls.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.cls.python.cls_engine
paddlespeech.server.engine.cls package
======================================
.. automodule:: paddlespeech.server.engine.cls
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.cls.paddleinference
paddlespeech.server.engine.cls.python
paddlespeech.server.engine.engine\_factory module
=================================================
.. automodule:: paddlespeech.server.engine.engine_factory
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.engine\_pool module
==============================================
.. automodule:: paddlespeech.server.engine.engine_pool
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.engine\_warmup module
================================================
.. automodule:: paddlespeech.server.engine.engine_warmup
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine package
==================================
.. automodule:: paddlespeech.server.engine
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.acs
paddlespeech.server.engine.asr
paddlespeech.server.engine.cls
paddlespeech.server.engine.text
paddlespeech.server.engine.tts
paddlespeech.server.engine.vector
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.base_engine
paddlespeech.server.engine.engine_factory
paddlespeech.server.engine.engine_pool
paddlespeech.server.engine.engine_warmup
paddlespeech.server.engine.text.python package
==============================================
.. automodule:: paddlespeech.server.engine.text.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.text.python.text_engine
paddlespeech.server.engine.text.python.text\_engine module
==========================================================
.. automodule:: paddlespeech.server.engine.text.python.text_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.text package
=======================================
.. automodule:: paddlespeech.server.engine.text
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.text.python
paddlespeech.server.engine.tts.online.onnx package
==================================================
.. automodule:: paddlespeech.server.engine.tts.online.onnx
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.online.onnx.tts_engine
paddlespeech.server.engine.tts.online.onnx.tts\_engine module
=============================================================
.. automodule:: paddlespeech.server.engine.tts.online.onnx.tts_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.tts.online.python package
====================================================
.. automodule:: paddlespeech.server.engine.tts.online.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.online.python.tts_engine
paddlespeech.server.engine.tts.online.python.tts\_engine module
===============================================================
.. automodule:: paddlespeech.server.engine.tts.online.python.tts_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.tts.online package
=============================================
.. automodule:: paddlespeech.server.engine.tts.online
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.online.onnx
paddlespeech.server.engine.tts.online.python
paddlespeech.server.engine.tts.paddleinference package
======================================================
.. automodule:: paddlespeech.server.engine.tts.paddleinference
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.paddleinference.tts_engine
paddlespeech.server.engine.tts.paddleinference.tts\_engine module
=================================================================
.. automodule:: paddlespeech.server.engine.tts.paddleinference.tts_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.tts.python package
=============================================
.. automodule:: paddlespeech.server.engine.tts.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.python.tts_engine
paddlespeech.server.engine.tts.python.tts\_engine module
========================================================
.. automodule:: paddlespeech.server.engine.tts.python.tts_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.tts package
======================================
.. automodule:: paddlespeech.server.engine.tts
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.tts.online
paddlespeech.server.engine.tts.paddleinference
paddlespeech.server.engine.tts.python
paddlespeech.server.engine.vector.python package
================================================
.. automodule:: paddlespeech.server.engine.vector.python
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.vector.python.vector_engine
paddlespeech.server.engine.vector.python.vector\_engine module
==============================================================
.. automodule:: paddlespeech.server.engine.vector.python.vector_engine
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.engine.vector package
=========================================
.. automodule:: paddlespeech.server.engine.vector
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.engine.vector.python
paddlespeech.server.entry module
================================
.. automodule:: paddlespeech.server.entry
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.executor module
===================================
.. automodule:: paddlespeech.server.executor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.acs\_api module
===========================================
.. automodule:: paddlespeech.server.restful.acs_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.api module
======================================
.. automodule:: paddlespeech.server.restful.api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.asr\_api module
===========================================
.. automodule:: paddlespeech.server.restful.asr_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.cls\_api module
===========================================
.. automodule:: paddlespeech.server.restful.cls_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.request module
==========================================
.. automodule:: paddlespeech.server.restful.request
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.response module
===========================================
.. automodule:: paddlespeech.server.restful.response
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful package
===================================
.. automodule:: paddlespeech.server.restful
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.restful.acs_api
paddlespeech.server.restful.api
paddlespeech.server.restful.asr_api
paddlespeech.server.restful.cls_api
paddlespeech.server.restful.request
paddlespeech.server.restful.response
paddlespeech.server.restful.text_api
paddlespeech.server.restful.tts_api
paddlespeech.server.restful.vector_api
paddlespeech.server.restful.text\_api module
============================================
.. automodule:: paddlespeech.server.restful.text_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.tts\_api module
===========================================
.. automodule:: paddlespeech.server.restful.tts_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.restful.vector\_api module
==============================================
.. automodule:: paddlespeech.server.restful.vector_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server package
===========================
.. automodule:: paddlespeech.server
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.bin
paddlespeech.server.engine
paddlespeech.server.restful
paddlespeech.server.tests
paddlespeech.server.utils
paddlespeech.server.ws
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.base_commands
paddlespeech.server.entry
paddlespeech.server.executor
paddlespeech.server.util
paddlespeech.server.tests.asr.offline.http\_client module
=========================================================
.. automodule:: paddlespeech.server.tests.asr.offline.http_client
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.tests.asr.offline package
=============================================
.. automodule:: paddlespeech.server.tests.asr.offline
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.tests.asr.offline.http_client
paddlespeech.server.tests.asr package
=====================================
.. automodule:: paddlespeech.server.tests.asr
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.tests.asr.offline
paddlespeech.server.tests package
=================================
.. automodule:: paddlespeech.server.tests
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.server.tests.asr
paddlespeech.server.util module
===============================
.. automodule:: paddlespeech.server.util
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.audio\_handler module
===============================================
.. automodule:: paddlespeech.server.utils.audio_handler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.audio\_process module
===============================================
.. automodule:: paddlespeech.server.utils.audio_process
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.buffer module
=======================================
.. automodule:: paddlespeech.server.utils.buffer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.config module
=======================================
.. automodule:: paddlespeech.server.utils.config
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.errors module
=======================================
.. automodule:: paddlespeech.server.utils.errors
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.exception module
==========================================
.. automodule:: paddlespeech.server.utils.exception
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.log module
====================================
.. automodule:: paddlespeech.server.utils.log
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.onnx\_infer module
============================================
.. automodule:: paddlespeech.server.utils.onnx_infer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.paddle\_predictor module
==================================================
.. automodule:: paddlespeech.server.utils.paddle_predictor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils package
=================================
.. automodule:: paddlespeech.server.utils
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.utils.audio_handler
paddlespeech.server.utils.audio_process
paddlespeech.server.utils.buffer
paddlespeech.server.utils.config
paddlespeech.server.utils.errors
paddlespeech.server.utils.exception
paddlespeech.server.utils.log
paddlespeech.server.utils.onnx_infer
paddlespeech.server.utils.paddle_predictor
paddlespeech.server.utils.util
paddlespeech.server.utils.vad
paddlespeech.server.utils.util module
=====================================
.. automodule:: paddlespeech.server.utils.util
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.utils.vad module
====================================
.. automodule:: paddlespeech.server.utils.vad
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.ws.api module
=================================
.. automodule:: paddlespeech.server.ws.api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.ws.asr\_api module
======================================
.. automodule:: paddlespeech.server.ws.asr_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.server.ws package
==============================
.. automodule:: paddlespeech.server.ws
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.server.ws.api
paddlespeech.server.ws.asr_api
paddlespeech.server.ws.tts_api
paddlespeech.server.ws.tts\_api module
======================================
.. automodule:: paddlespeech.server.ws.tts_api
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.audio.audio module
===================================
.. automodule:: paddlespeech.t2s.audio.audio
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.audio.codec module
===================================
.. automodule:: paddlespeech.t2s.audio.codec
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.audio package
==============================
.. automodule:: paddlespeech.t2s.audio
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.audio.audio
paddlespeech.t2s.audio.codec
paddlespeech.t2s.audio.spec_normalizer
paddlespeech.t2s.audio.spec\_normalizer module
==============================================
.. automodule:: paddlespeech.t2s.audio.spec_normalizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.am\_batch\_fn module
==============================================
.. automodule:: paddlespeech.t2s.datasets.am_batch_fn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.batch module
======================================
.. automodule:: paddlespeech.t2s.datasets.batch
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.data\_table module
============================================
.. automodule:: paddlespeech.t2s.datasets.data_table
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.dataset module
========================================
.. automodule:: paddlespeech.t2s.datasets.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.get\_feats module
===========================================
.. automodule:: paddlespeech.t2s.datasets.get_feats
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.ljspeech module
=========================================
.. automodule:: paddlespeech.t2s.datasets.ljspeech
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets.preprocess\_utils module
==================================================
.. automodule:: paddlespeech.t2s.datasets.preprocess_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.datasets package
=================================
.. automodule:: paddlespeech.t2s.datasets
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.datasets.am_batch_fn
paddlespeech.t2s.datasets.batch
paddlespeech.t2s.datasets.data_table
paddlespeech.t2s.datasets.dataset
paddlespeech.t2s.datasets.get_feats
paddlespeech.t2s.datasets.ljspeech
paddlespeech.t2s.datasets.preprocess_utils
paddlespeech.t2s.datasets.vocoder_batch_fn
paddlespeech.t2s.datasets.vocoder\_batch\_fn module
===================================================
.. automodule:: paddlespeech.t2s.datasets.vocoder_batch_fn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.fastspeech2.gen\_gta\_mel module
======================================================
.. automodule:: paddlespeech.t2s.exps.fastspeech2.gen_gta_mel
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.fastspeech2.normalize module
==================================================
.. automodule:: paddlespeech.t2s.exps.fastspeech2.normalize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.fastspeech2.preprocess module
===================================================
.. automodule:: paddlespeech.t2s.exps.fastspeech2.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.fastspeech2 package
=========================================
.. automodule:: paddlespeech.t2s.exps.fastspeech2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.fastspeech2.gen_gta_mel
paddlespeech.t2s.exps.fastspeech2.normalize
paddlespeech.t2s.exps.fastspeech2.preprocess
paddlespeech.t2s.exps.fastspeech2.train
paddlespeech.t2s.exps.fastspeech2.train module
==============================================
.. automodule:: paddlespeech.t2s.exps.fastspeech2.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.hifigan package
==================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.hifigan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.hifigan.train
paddlespeech.t2s.exps.gan\_vocoder.hifigan.train module
=======================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.hifigan.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.multi\_band\_melgan package
==============================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.multi_band_melgan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.multi_band_melgan.train
paddlespeech.t2s.exps.gan\_vocoder.multi\_band\_melgan.train module
===================================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.multi_band_melgan.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.normalize module
===================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.normalize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.parallelwave\_gan package
============================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan.synthesize_from_wav
paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan.train
paddlespeech.t2s.exps.gan\_vocoder.parallelwave\_gan.synthesize\_from\_wav module
=================================================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan.synthesize_from_wav
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.parallelwave\_gan.train module
=================================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.preprocess module
====================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder package
==========================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.hifigan
paddlespeech.t2s.exps.gan_vocoder.multi_band_melgan
paddlespeech.t2s.exps.gan_vocoder.parallelwave_gan
paddlespeech.t2s.exps.gan_vocoder.style_melgan
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.normalize
paddlespeech.t2s.exps.gan_vocoder.preprocess
paddlespeech.t2s.exps.gan_vocoder.synthesize
paddlespeech.t2s.exps.gan\_vocoder.style\_melgan package
========================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.style_melgan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.gan_vocoder.style_melgan.train
paddlespeech.t2s.exps.gan\_vocoder.style\_melgan.train module
=============================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.style_melgan.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.gan\_vocoder.synthesize module
====================================================
.. automodule:: paddlespeech.t2s.exps.gan_vocoder.synthesize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.inference module
======================================
.. automodule:: paddlespeech.t2s.exps.inference
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.inference\_streaming module
=================================================
.. automodule:: paddlespeech.t2s.exps.inference_streaming
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.ort\_predict module
=========================================
.. automodule:: paddlespeech.t2s.exps.ort_predict
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.ort\_predict\_e2e module
==============================================
.. automodule:: paddlespeech.t2s.exps.ort_predict_e2e
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.ort\_predict\_streaming module
====================================================
.. automodule:: paddlespeech.t2s.exps.ort_predict_streaming
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps package
=============================
.. automodule:: paddlespeech.t2s.exps
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.fastspeech2
paddlespeech.t2s.exps.gan_vocoder
paddlespeech.t2s.exps.speedyspeech
paddlespeech.t2s.exps.tacotron2
paddlespeech.t2s.exps.transformer_tts
paddlespeech.t2s.exps.waveflow
paddlespeech.t2s.exps.wavernn
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.inference
paddlespeech.t2s.exps.inference_streaming
paddlespeech.t2s.exps.ort_predict
paddlespeech.t2s.exps.ort_predict_e2e
paddlespeech.t2s.exps.ort_predict_streaming
paddlespeech.t2s.exps.syn_utils
paddlespeech.t2s.exps.synthesize
paddlespeech.t2s.exps.synthesize_e2e
paddlespeech.t2s.exps.synthesize_streaming
paddlespeech.t2s.exps.voice_cloning
paddlespeech.t2s.exps.speedyspeech.gen\_gta\_mel module
=======================================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.gen_gta_mel
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.speedyspeech.inference module
===================================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.inference
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.speedyspeech.normalize module
===================================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.normalize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.speedyspeech.preprocess module
====================================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.speedyspeech package
==========================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.speedyspeech.gen_gta_mel
paddlespeech.t2s.exps.speedyspeech.inference
paddlespeech.t2s.exps.speedyspeech.normalize
paddlespeech.t2s.exps.speedyspeech.preprocess
paddlespeech.t2s.exps.speedyspeech.synthesize_e2e
paddlespeech.t2s.exps.speedyspeech.train
paddlespeech.t2s.exps.speedyspeech.synthesize\_e2e module
=========================================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.synthesize_e2e
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.speedyspeech.train module
===============================================
.. automodule:: paddlespeech.t2s.exps.speedyspeech.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.syn\_utils module
=======================================
.. automodule:: paddlespeech.t2s.exps.syn_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.synthesize module
=======================================
.. automodule:: paddlespeech.t2s.exps.synthesize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.synthesize\_e2e module
============================================
.. automodule:: paddlespeech.t2s.exps.synthesize_e2e
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.synthesize\_streaming module
==================================================
.. automodule:: paddlespeech.t2s.exps.synthesize_streaming
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.tacotron2.normalize module
================================================
.. automodule:: paddlespeech.t2s.exps.tacotron2.normalize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.tacotron2.preprocess module
=================================================
.. automodule:: paddlespeech.t2s.exps.tacotron2.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.tacotron2 package
=======================================
.. automodule:: paddlespeech.t2s.exps.tacotron2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.tacotron2.normalize
paddlespeech.t2s.exps.tacotron2.preprocess
paddlespeech.t2s.exps.tacotron2.train
paddlespeech.t2s.exps.tacotron2.train module
============================================
.. automodule:: paddlespeech.t2s.exps.tacotron2.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.transformer\_tts.normalize module
=======================================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts.normalize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.transformer\_tts.preprocess module
========================================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.transformer\_tts package
==============================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.transformer_tts.normalize
paddlespeech.t2s.exps.transformer_tts.preprocess
paddlespeech.t2s.exps.transformer_tts.synthesize
paddlespeech.t2s.exps.transformer_tts.synthesize_e2e
paddlespeech.t2s.exps.transformer_tts.train
paddlespeech.t2s.exps.transformer\_tts.synthesize module
========================================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts.synthesize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.transformer\_tts.synthesize\_e2e module
=============================================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts.synthesize_e2e
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.transformer\_tts.train module
===================================================
.. automodule:: paddlespeech.t2s.exps.transformer_tts.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.voice\_cloning module
===========================================
.. automodule:: paddlespeech.t2s.exps.voice_cloning
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.waveflow.config module
============================================
.. automodule:: paddlespeech.t2s.exps.waveflow.config
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.waveflow.ljspeech module
==============================================
.. automodule:: paddlespeech.t2s.exps.waveflow.ljspeech
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.waveflow.preprocess module
================================================
.. automodule:: paddlespeech.t2s.exps.waveflow.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.waveflow package
======================================
.. automodule:: paddlespeech.t2s.exps.waveflow
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.waveflow.config
paddlespeech.t2s.exps.waveflow.ljspeech
paddlespeech.t2s.exps.waveflow.preprocess
paddlespeech.t2s.exps.waveflow.synthesize
paddlespeech.t2s.exps.waveflow.train
paddlespeech.t2s.exps.waveflow.synthesize module
================================================
.. automodule:: paddlespeech.t2s.exps.waveflow.synthesize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.waveflow.train module
===========================================
.. automodule:: paddlespeech.t2s.exps.waveflow.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.wavernn package
=====================================
.. automodule:: paddlespeech.t2s.exps.wavernn
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.exps.wavernn.synthesize
paddlespeech.t2s.exps.wavernn.train
paddlespeech.t2s.exps.wavernn.synthesize module
===============================================
.. automodule:: paddlespeech.t2s.exps.wavernn.synthesize
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.exps.wavernn.train module
==========================================
.. automodule:: paddlespeech.t2s.exps.wavernn.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.arpabet module
========================================
.. automodule:: paddlespeech.t2s.frontend.arpabet
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.generate\_lexicon module
==================================================
.. automodule:: paddlespeech.t2s.frontend.generate_lexicon
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.normalizer.abbrrviation module
========================================================
.. automodule:: paddlespeech.t2s.frontend.normalizer.abbrrviation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.normalizer.acronyms module
====================================================
.. automodule:: paddlespeech.t2s.frontend.normalizer.acronyms
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.normalizer.normalizer module
======================================================
.. automodule:: paddlespeech.t2s.frontend.normalizer.normalizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.normalizer.numbers module
===================================================
.. automodule:: paddlespeech.t2s.frontend.normalizer.numbers
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.normalizer package
============================================
.. automodule:: paddlespeech.t2s.frontend.normalizer
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.frontend.normalizer.abbrrviation
paddlespeech.t2s.frontend.normalizer.acronyms
paddlespeech.t2s.frontend.normalizer.normalizer
paddlespeech.t2s.frontend.normalizer.numbers
paddlespeech.t2s.frontend.normalizer.width
paddlespeech.t2s.frontend.normalizer.width module
=================================================
.. automodule:: paddlespeech.t2s.frontend.normalizer.width
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.phonectic module
==========================================
.. automodule:: paddlespeech.t2s.frontend.phonectic
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.punctuation module
============================================
.. automodule:: paddlespeech.t2s.frontend.punctuation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend package
=================================
.. automodule:: paddlespeech.t2s.frontend
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.frontend.normalizer
paddlespeech.t2s.frontend.zh_normalization
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.frontend.arpabet
paddlespeech.t2s.frontend.generate_lexicon
paddlespeech.t2s.frontend.phonectic
paddlespeech.t2s.frontend.punctuation
paddlespeech.t2s.frontend.tone_sandhi
paddlespeech.t2s.frontend.vocab
paddlespeech.t2s.frontend.zh_frontend
paddlespeech.t2s.frontend.tone\_sandhi module
=============================================
.. automodule:: paddlespeech.t2s.frontend.tone_sandhi
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.vocab module
======================================
.. automodule:: paddlespeech.t2s.frontend.vocab
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_frontend module
=============================================
.. automodule:: paddlespeech.t2s.frontend.zh_frontend
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.char\_convert module
================================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.char_convert
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.chronology module
=============================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.chronology
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.constants module
============================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.constants
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.num module
======================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.num
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.phonecode module
============================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.phonecode
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization.quantifier module
=============================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.quantifier
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.frontend.zh\_normalization package
===================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.frontend.zh_normalization.char_convert
paddlespeech.t2s.frontend.zh_normalization.chronology
paddlespeech.t2s.frontend.zh_normalization.constants
paddlespeech.t2s.frontend.zh_normalization.num
paddlespeech.t2s.frontend.zh_normalization.phonecode
paddlespeech.t2s.frontend.zh_normalization.quantifier
paddlespeech.t2s.frontend.zh_normalization.text_normlization
paddlespeech.t2s.frontend.zh\_normalization.text\_normlization module
=====================================================================
.. automodule:: paddlespeech.t2s.frontend.zh_normalization.text_normlization
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.ernie\_sat.mlm module
=============================================
.. automodule:: paddlespeech.t2s.models.ernie_sat.mlm
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.ernie\_sat package
==========================================
.. automodule:: paddlespeech.t2s.models.ernie_sat
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.ernie_sat.mlm
paddlespeech.t2s.models.fastspeech2.fastspeech2 module
======================================================
.. automodule:: paddlespeech.t2s.models.fastspeech2.fastspeech2
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.fastspeech2.fastspeech2\_updater module
===============================================================
.. automodule:: paddlespeech.t2s.models.fastspeech2.fastspeech2_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.fastspeech2 package
===========================================
.. automodule:: paddlespeech.t2s.models.fastspeech2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.fastspeech2.fastspeech2
paddlespeech.t2s.models.fastspeech2.fastspeech2_updater
paddlespeech.t2s.models.hifigan.hifigan module
==============================================
.. automodule:: paddlespeech.t2s.models.hifigan.hifigan
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.hifigan.hifigan\_updater module
=======================================================
.. automodule:: paddlespeech.t2s.models.hifigan.hifigan_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.hifigan package
=======================================
.. automodule:: paddlespeech.t2s.models.hifigan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.hifigan.hifigan
paddlespeech.t2s.models.hifigan.hifigan_updater
paddlespeech.t2s.models.melgan.melgan module
============================================
.. automodule:: paddlespeech.t2s.models.melgan.melgan
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.melgan.multi\_band\_melgan\_updater module
==================================================================
.. automodule:: paddlespeech.t2s.models.melgan.multi_band_melgan_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.melgan package
======================================
.. automodule:: paddlespeech.t2s.models.melgan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.melgan.melgan
paddlespeech.t2s.models.melgan.multi_band_melgan_updater
paddlespeech.t2s.models.melgan.style_melgan
paddlespeech.t2s.models.melgan.style_melgan_updater
paddlespeech.t2s.models.melgan.style\_melgan module
===================================================
.. automodule:: paddlespeech.t2s.models.melgan.style_melgan
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.melgan.style\_melgan\_updater module
============================================================
.. automodule:: paddlespeech.t2s.models.melgan.style_melgan_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.parallel\_wavegan.parallel\_wavegan module
==================================================================
.. automodule:: paddlespeech.t2s.models.parallel_wavegan.parallel_wavegan
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.parallel\_wavegan.parallel\_wavegan\_updater module
===========================================================================
.. automodule:: paddlespeech.t2s.models.parallel_wavegan.parallel_wavegan_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.parallel\_wavegan package
=================================================
.. automodule:: paddlespeech.t2s.models.parallel_wavegan
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.parallel_wavegan.parallel_wavegan
paddlespeech.t2s.models.parallel_wavegan.parallel_wavegan_updater
paddlespeech.t2s.models package
===============================
.. automodule:: paddlespeech.t2s.models
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.ernie_sat
paddlespeech.t2s.models.fastspeech2
paddlespeech.t2s.models.hifigan
paddlespeech.t2s.models.melgan
paddlespeech.t2s.models.parallel_wavegan
paddlespeech.t2s.models.speedyspeech
paddlespeech.t2s.models.tacotron2
paddlespeech.t2s.models.transformer_tts
paddlespeech.t2s.models.vits
paddlespeech.t2s.models.wavernn
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.waveflow
paddlespeech.t2s.models.speedyspeech package
============================================
.. automodule:: paddlespeech.t2s.models.speedyspeech
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.speedyspeech.speedyspeech
paddlespeech.t2s.models.speedyspeech.speedyspeech_updater
paddlespeech.t2s.models.speedyspeech.speedyspeech module
========================================================
.. automodule:: paddlespeech.t2s.models.speedyspeech.speedyspeech
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.speedyspeech.speedyspeech\_updater module
=================================================================
.. automodule:: paddlespeech.t2s.models.speedyspeech.speedyspeech_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.tacotron2 package
=========================================
.. automodule:: paddlespeech.t2s.models.tacotron2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.tacotron2.tacotron2
paddlespeech.t2s.models.tacotron2.tacotron2_updater
paddlespeech.t2s.models.tacotron2.tacotron2 module
==================================================
.. automodule:: paddlespeech.t2s.models.tacotron2.tacotron2
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.tacotron2.tacotron2\_updater module
===========================================================
.. automodule:: paddlespeech.t2s.models.tacotron2.tacotron2_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.transformer\_tts package
================================================
.. automodule:: paddlespeech.t2s.models.transformer_tts
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.transformer_tts.transformer_tts
paddlespeech.t2s.models.transformer_tts.transformer_tts_updater
paddlespeech.t2s.models.transformer\_tts.transformer\_tts module
================================================================
.. automodule:: paddlespeech.t2s.models.transformer_tts.transformer_tts
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.transformer\_tts.transformer\_tts\_updater module
=========================================================================
.. automodule:: paddlespeech.t2s.models.transformer_tts.transformer_tts_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.duration\_predictor module
=======================================================
.. automodule:: paddlespeech.t2s.models.vits.duration_predictor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.flow module
========================================
.. automodule:: paddlespeech.t2s.models.vits.flow
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.generator module
=============================================
.. automodule:: paddlespeech.t2s.models.vits.generator
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.posterior\_encoder module
======================================================
.. automodule:: paddlespeech.t2s.models.vits.posterior_encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.residual\_coupling module
======================================================
.. automodule:: paddlespeech.t2s.models.vits.residual_coupling
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits package
====================================
.. automodule:: paddlespeech.t2s.models.vits
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.vits.monotonic_align
paddlespeech.t2s.models.vits.wavenet
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.vits.duration_predictor
paddlespeech.t2s.models.vits.flow
paddlespeech.t2s.models.vits.generator
paddlespeech.t2s.models.vits.posterior_encoder
paddlespeech.t2s.models.vits.residual_coupling
paddlespeech.t2s.models.vits.text_encoder
paddlespeech.t2s.models.vits.transform
paddlespeech.t2s.models.vits.vits
paddlespeech.t2s.models.vits.vits_updater
paddlespeech.t2s.models.vits.text\_encoder module
=================================================
.. automodule:: paddlespeech.t2s.models.vits.text_encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.transform module
=============================================
.. automodule:: paddlespeech.t2s.models.vits.transform
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.vits module
========================================
.. automodule:: paddlespeech.t2s.models.vits.vits
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.vits\_updater module
=================================================
.. automodule:: paddlespeech.t2s.models.vits.vits_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.wavenet.residual\_block module
===========================================================
.. automodule:: paddlespeech.t2s.models.vits.wavenet.residual_block
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.vits.wavenet package
============================================
.. automodule:: paddlespeech.t2s.models.vits.wavenet
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.vits.wavenet.residual_block
paddlespeech.t2s.models.vits.wavenet.wavenet
paddlespeech.t2s.models.vits.wavenet.wavenet module
===================================================
.. automodule:: paddlespeech.t2s.models.vits.wavenet.wavenet
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.waveflow module
=======================================
.. automodule:: paddlespeech.t2s.models.waveflow
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.wavernn package
=======================================
.. automodule:: paddlespeech.t2s.models.wavernn
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.models.wavernn.wavernn
paddlespeech.t2s.models.wavernn.wavernn_updater
paddlespeech.t2s.models.wavernn.wavernn module
==============================================
.. automodule:: paddlespeech.t2s.models.wavernn.wavernn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.models.wavernn.wavernn\_updater module
=======================================================
.. automodule:: paddlespeech.t2s.models.wavernn.wavernn_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.activation module
==========================================
.. automodule:: paddlespeech.t2s.modules.activation
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.causal\_conv module
============================================
.. automodule:: paddlespeech.t2s.modules.causal_conv
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.conformer.convolution module
=====================================================
.. automodule:: paddlespeech.t2s.modules.conformer.convolution
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.conformer.encoder\_layer module
========================================================
.. automodule:: paddlespeech.t2s.modules.conformer.encoder_layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.conformer package
==========================================
.. automodule:: paddlespeech.t2s.modules.conformer
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.conformer.convolution
paddlespeech.t2s.modules.conformer.encoder_layer
paddlespeech.t2s.modules.conv module
====================================
.. automodule:: paddlespeech.t2s.modules.conv
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.geometry module
========================================
.. automodule:: paddlespeech.t2s.modules.geometry
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.layer\_norm module
===========================================
.. automodule:: paddlespeech.t2s.modules.layer_norm
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.losses module
======================================
.. automodule:: paddlespeech.t2s.modules.losses
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.masked\_fill module
============================================
.. automodule:: paddlespeech.t2s.modules.masked_fill
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.nets\_utils module
===========================================
.. automodule:: paddlespeech.t2s.modules.nets_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.normalizer module
==========================================
.. automodule:: paddlespeech.t2s.modules.normalizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.positional\_encoding module
====================================================
.. automodule:: paddlespeech.t2s.modules.positional_encoding
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.pqmf module
====================================
.. automodule:: paddlespeech.t2s.modules.pqmf
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.predictor.duration\_predictor module
=============================================================
.. automodule:: paddlespeech.t2s.modules.predictor.duration_predictor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.predictor.length\_regulator module
===========================================================
.. automodule:: paddlespeech.t2s.modules.predictor.length_regulator
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.predictor package
==========================================
.. automodule:: paddlespeech.t2s.modules.predictor
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.predictor.duration_predictor
paddlespeech.t2s.modules.predictor.length_regulator
paddlespeech.t2s.modules.predictor.variance_predictor
paddlespeech.t2s.modules.predictor.variance\_predictor module
=============================================================
.. automodule:: paddlespeech.t2s.modules.predictor.variance_predictor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.residual\_block module
===============================================
.. automodule:: paddlespeech.t2s.modules.residual_block
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.residual\_stack module
===============================================
.. automodule:: paddlespeech.t2s.modules.residual_stack
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules package
================================
.. automodule:: paddlespeech.t2s.modules
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.conformer
paddlespeech.t2s.modules.predictor
paddlespeech.t2s.modules.tacotron2
paddlespeech.t2s.modules.transformer
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.activation
paddlespeech.t2s.modules.causal_conv
paddlespeech.t2s.modules.conv
paddlespeech.t2s.modules.geometry
paddlespeech.t2s.modules.layer_norm
paddlespeech.t2s.modules.losses
paddlespeech.t2s.modules.masked_fill
paddlespeech.t2s.modules.nets_utils
paddlespeech.t2s.modules.normalizer
paddlespeech.t2s.modules.positional_encoding
paddlespeech.t2s.modules.pqmf
paddlespeech.t2s.modules.residual_block
paddlespeech.t2s.modules.residual_stack
paddlespeech.t2s.modules.style_encoder
paddlespeech.t2s.modules.tade_res_block
paddlespeech.t2s.modules.upsample
paddlespeech.t2s.modules.style\_encoder module
==============================================
.. automodule:: paddlespeech.t2s.modules.style_encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.tacotron2.attentions module
====================================================
.. automodule:: paddlespeech.t2s.modules.tacotron2.attentions
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.tacotron2.decoder module
=================================================
.. automodule:: paddlespeech.t2s.modules.tacotron2.decoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.tacotron2.encoder module
=================================================
.. automodule:: paddlespeech.t2s.modules.tacotron2.encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.tacotron2 package
==========================================
.. automodule:: paddlespeech.t2s.modules.tacotron2
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.tacotron2.attentions
paddlespeech.t2s.modules.tacotron2.decoder
paddlespeech.t2s.modules.tacotron2.encoder
paddlespeech.t2s.modules.tade\_res\_block module
================================================
.. automodule:: paddlespeech.t2s.modules.tade_res_block
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.attention module
=====================================================
.. automodule:: paddlespeech.t2s.modules.transformer.attention
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.decoder module
===================================================
.. automodule:: paddlespeech.t2s.modules.transformer.decoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.decoder\_layer module
==========================================================
.. automodule:: paddlespeech.t2s.modules.transformer.decoder_layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.embedding module
=====================================================
.. automodule:: paddlespeech.t2s.modules.transformer.embedding
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.encoder module
===================================================
.. automodule:: paddlespeech.t2s.modules.transformer.encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.encoder\_layer module
==========================================================
.. automodule:: paddlespeech.t2s.modules.transformer.encoder_layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.lightconv module
=====================================================
.. automodule:: paddlespeech.t2s.modules.transformer.lightconv
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.mask module
================================================
.. automodule:: paddlespeech.t2s.modules.transformer.mask
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.multi\_layer\_conv module
==============================================================
.. automodule:: paddlespeech.t2s.modules.transformer.multi_layer_conv
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.positionwise\_feed\_forward module
=======================================================================
.. automodule:: paddlespeech.t2s.modules.transformer.positionwise_feed_forward
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer.repeat module
==================================================
.. automodule:: paddlespeech.t2s.modules.transformer.repeat
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.transformer package
============================================
.. automodule:: paddlespeech.t2s.modules.transformer
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.modules.transformer.attention
paddlespeech.t2s.modules.transformer.decoder
paddlespeech.t2s.modules.transformer.decoder_layer
paddlespeech.t2s.modules.transformer.embedding
paddlespeech.t2s.modules.transformer.encoder
paddlespeech.t2s.modules.transformer.encoder_layer
paddlespeech.t2s.modules.transformer.lightconv
paddlespeech.t2s.modules.transformer.mask
paddlespeech.t2s.modules.transformer.multi_layer_conv
paddlespeech.t2s.modules.transformer.positionwise_feed_forward
paddlespeech.t2s.modules.transformer.repeat
paddlespeech.t2s.modules.transformer.subsampling
paddlespeech.t2s.modules.transformer.subsampling module
=======================================================
.. automodule:: paddlespeech.t2s.modules.transformer.subsampling
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.modules.upsample module
========================================
.. automodule:: paddlespeech.t2s.modules.upsample
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s package
========================
.. automodule:: paddlespeech.t2s
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.audio
paddlespeech.t2s.datasets
paddlespeech.t2s.exps
paddlespeech.t2s.frontend
paddlespeech.t2s.models
paddlespeech.t2s.modules
paddlespeech.t2s.training
paddlespeech.t2s.utils
paddlespeech.t2s.training.cli module
====================================
.. automodule:: paddlespeech.t2s.training.cli
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.default\_config module
================================================
.. automodule:: paddlespeech.t2s.training.default_config
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.experiment module
===========================================
.. automodule:: paddlespeech.t2s.training.experiment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.extension module
==========================================
.. automodule:: paddlespeech.t2s.training.extension
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.extensions.evaluator module
=====================================================
.. automodule:: paddlespeech.t2s.training.extensions.evaluator
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.extensions package
============================================
.. automodule:: paddlespeech.t2s.training.extensions
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.training.extensions.evaluator
paddlespeech.t2s.training.extensions.snapshot
paddlespeech.t2s.training.extensions.visualizer
paddlespeech.t2s.training.extensions.snapshot module
====================================================
.. automodule:: paddlespeech.t2s.training.extensions.snapshot
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.extensions.visualizer module
======================================================
.. automodule:: paddlespeech.t2s.training.extensions.visualizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.optimizer module
==========================================
.. automodule:: paddlespeech.t2s.training.optimizer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.reporter module
=========================================
.. automodule:: paddlespeech.t2s.training.reporter
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training package
=================================
.. automodule:: paddlespeech.t2s.training
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.training.extensions
paddlespeech.t2s.training.triggers
paddlespeech.t2s.training.updaters
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.training.cli
paddlespeech.t2s.training.default_config
paddlespeech.t2s.training.experiment
paddlespeech.t2s.training.extension
paddlespeech.t2s.training.optimizer
paddlespeech.t2s.training.reporter
paddlespeech.t2s.training.seeding
paddlespeech.t2s.training.trainer
paddlespeech.t2s.training.trigger
paddlespeech.t2s.training.updater
paddlespeech.t2s.training.seeding module
========================================
.. automodule:: paddlespeech.t2s.training.seeding
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.trainer module
========================================
.. automodule:: paddlespeech.t2s.training.trainer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.trigger module
========================================
.. automodule:: paddlespeech.t2s.training.trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.triggers.interval\_trigger module
===========================================================
.. automodule:: paddlespeech.t2s.training.triggers.interval_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.triggers.limit\_trigger module
========================================================
.. automodule:: paddlespeech.t2s.training.triggers.limit_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.triggers package
==========================================
.. automodule:: paddlespeech.t2s.training.triggers
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.training.triggers.interval_trigger
paddlespeech.t2s.training.triggers.limit_trigger
paddlespeech.t2s.training.triggers.time_trigger
paddlespeech.t2s.training.triggers.time\_trigger module
=======================================================
.. automodule:: paddlespeech.t2s.training.triggers.time_trigger
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.updater module
========================================
.. automodule:: paddlespeech.t2s.training.updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.training.updaters package
==========================================
.. automodule:: paddlespeech.t2s.training.updaters
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.training.updaters.standard_updater
paddlespeech.t2s.training.updaters.standard\_updater module
===========================================================
.. automodule:: paddlespeech.t2s.training.updaters.standard_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.checkpoint module
========================================
.. automodule:: paddlespeech.t2s.utils.checkpoint
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.display module
=====================================
.. automodule:: paddlespeech.t2s.utils.display
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.error\_rate module
=========================================
.. automodule:: paddlespeech.t2s.utils.error_rate
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.h5\_utils module
=======================================
.. automodule:: paddlespeech.t2s.utils.h5_utils
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.internals module
=======================================
.. automodule:: paddlespeech.t2s.utils.internals
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.layer\_tools module
==========================================
.. automodule:: paddlespeech.t2s.utils.layer_tools
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.mp\_tools module
=======================================
.. automodule:: paddlespeech.t2s.utils.mp_tools
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils.profiler module
======================================
.. automodule:: paddlespeech.t2s.utils.profiler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.t2s.utils package
==============================
.. automodule:: paddlespeech.t2s.utils
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.t2s.utils.checkpoint
paddlespeech.t2s.utils.display
paddlespeech.t2s.utils.error_rate
paddlespeech.t2s.utils.h5_utils
paddlespeech.t2s.utils.internals
paddlespeech.t2s.utils.layer_tools
paddlespeech.t2s.utils.mp_tools
paddlespeech.t2s.utils.profiler
paddlespeech.t2s.utils.scheduler
paddlespeech.t2s.utils.scheduler module
=======================================
.. automodule:: paddlespeech.t2s.utils.scheduler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.exps.ernie\_linear.avg\_model module
======================================================
.. automodule:: paddlespeech.text.exps.ernie_linear.avg_model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.exps.ernie\_linear.punc\_restore module
=========================================================
.. automodule:: paddlespeech.text.exps.ernie_linear.punc_restore
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.exps.ernie\_linear package
============================================
.. automodule:: paddlespeech.text.exps.ernie_linear
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.text.exps.ernie_linear.avg_model
paddlespeech.text.exps.ernie_linear.punc_restore
paddlespeech.text.exps.ernie_linear.test
paddlespeech.text.exps.ernie_linear.train
paddlespeech.text.exps.ernie\_linear.test module
================================================
.. automodule:: paddlespeech.text.exps.ernie_linear.test
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.exps.ernie\_linear.train module
=================================================
.. automodule:: paddlespeech.text.exps.ernie_linear.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.exps package
==============================
.. automodule:: paddlespeech.text.exps
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.text.exps.ernie_linear
paddlespeech.text.models.ernie\_crf.model module
================================================
.. automodule:: paddlespeech.text.models.ernie_crf.model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.models.ernie\_crf package
===========================================
.. automodule:: paddlespeech.text.models.ernie_crf
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.text.models.ernie_crf.model
paddlespeech.text.models.ernie\_linear.dataset module
=====================================================
.. automodule:: paddlespeech.text.models.ernie_linear.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.models.ernie\_linear.ernie\_linear module
===========================================================
.. automodule:: paddlespeech.text.models.ernie_linear.ernie_linear
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.models.ernie\_linear.ernie\_linear\_updater module
====================================================================
.. automodule:: paddlespeech.text.models.ernie_linear.ernie_linear_updater
:members:
:undoc-members:
:show-inheritance:
paddlespeech.text.models.ernie\_linear package
==============================================
.. automodule:: paddlespeech.text.models.ernie_linear
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.text.models.ernie_linear.dataset
paddlespeech.text.models.ernie_linear.ernie_linear
paddlespeech.text.models.ernie_linear.ernie_linear_updater
paddlespeech.text.models package
================================
.. automodule:: paddlespeech.text.models
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.text.models.ernie_crf
paddlespeech.text.models.ernie_linear
paddlespeech.text package
=========================
.. automodule:: paddlespeech.text
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.text.exps
paddlespeech.text.models
paddlespeech.vector.cluster.diarization module
==============================================
.. automodule:: paddlespeech.vector.cluster.diarization
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.cluster.plda module
=======================================
.. automodule:: paddlespeech.vector.cluster.plda
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.cluster package
===================================
.. automodule:: paddlespeech.vector.cluster
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.cluster.diarization
paddlespeech.vector.cluster.plda
paddlespeech.vector.exps.ge2e.audio\_processor module
=====================================================
.. automodule:: paddlespeech.vector.exps.ge2e.audio_processor
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.config module
===========================================
.. automodule:: paddlespeech.vector.exps.ge2e.config
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.dataset\_processors module
========================================================
.. automodule:: paddlespeech.vector.exps.ge2e.dataset_processors
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.inference module
==============================================
.. automodule:: paddlespeech.vector.exps.ge2e.inference
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.preprocess module
===============================================
.. automodule:: paddlespeech.vector.exps.ge2e.preprocess
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.random\_cycle module
==================================================
.. automodule:: paddlespeech.vector.exps.ge2e.random_cycle
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e package
=====================================
.. automodule:: paddlespeech.vector.exps.ge2e
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.exps.ge2e.audio_processor
paddlespeech.vector.exps.ge2e.config
paddlespeech.vector.exps.ge2e.dataset_processors
paddlespeech.vector.exps.ge2e.inference
paddlespeech.vector.exps.ge2e.preprocess
paddlespeech.vector.exps.ge2e.random_cycle
paddlespeech.vector.exps.ge2e.speaker_verification_dataset
paddlespeech.vector.exps.ge2e.train
paddlespeech.vector.exps.ge2e.speaker\_verification\_dataset module
===================================================================
.. automodule:: paddlespeech.vector.exps.ge2e.speaker_verification_dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps.ge2e.train module
==========================================
.. automodule:: paddlespeech.vector.exps.ge2e.train
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.exps package
================================
.. automodule:: paddlespeech.vector.exps
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.exps.ge2e
paddlespeech.vector.io.augment module
=====================================
.. automodule:: paddlespeech.vector.io.augment
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.io.batch module
===================================
.. automodule:: paddlespeech.vector.io.batch
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.io.dataset module
=====================================
.. automodule:: paddlespeech.vector.io.dataset
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.io.dataset\_from\_json module
=================================================
.. automodule:: paddlespeech.vector.io.dataset_from_json
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.io.embedding\_norm module
=============================================
.. automodule:: paddlespeech.vector.io.embedding_norm
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.io package
==============================
.. automodule:: paddlespeech.vector.io
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.io.augment
paddlespeech.vector.io.batch
paddlespeech.vector.io.dataset
paddlespeech.vector.io.dataset_from_json
paddlespeech.vector.io.embedding_norm
paddlespeech.vector.io.signal_processing
paddlespeech.vector.io.signal\_processing module
================================================
.. automodule:: paddlespeech.vector.io.signal_processing
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.models.ecapa\_tdnn module
=============================================
.. automodule:: paddlespeech.vector.models.ecapa_tdnn
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.models.lstm\_speaker\_encoder module
========================================================
.. automodule:: paddlespeech.vector.models.lstm_speaker_encoder
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.models package
==================================
.. automodule:: paddlespeech.vector.models
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.models.ecapa_tdnn
paddlespeech.vector.models.lstm_speaker_encoder
paddlespeech.vector.modules.layer module
========================================
.. automodule:: paddlespeech.vector.modules.layer
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.modules.loss module
=======================================
.. automodule:: paddlespeech.vector.modules.loss
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.modules package
===================================
.. automodule:: paddlespeech.vector.modules
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.modules.layer
paddlespeech.vector.modules.loss
paddlespeech.vector.modules.sid_model
paddlespeech.vector.modules.sid\_model module
=============================================
.. automodule:: paddlespeech.vector.modules.sid_model
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector package
===========================
.. automodule:: paddlespeech.vector
:members:
:undoc-members:
:show-inheritance:
Subpackages
-----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.cluster
paddlespeech.vector.exps
paddlespeech.vector.io
paddlespeech.vector.models
paddlespeech.vector.modules
paddlespeech.vector.training
paddlespeech.vector.utils
paddlespeech.vector.training package
====================================
.. automodule:: paddlespeech.vector.training
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.training.scheduler
paddlespeech.vector.training.seeding
paddlespeech.vector.training.scheduler module
=============================================
.. automodule:: paddlespeech.vector.training.scheduler
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.training.seeding module
===========================================
.. automodule:: paddlespeech.vector.training.seeding
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.utils package
=================================
.. automodule:: paddlespeech.vector.utils
:members:
:undoc-members:
:show-inheritance:
Submodules
----------
.. toctree::
:maxdepth: 4
paddlespeech.vector.utils.time
paddlespeech.vector.utils.vector_utils
paddlespeech.vector.utils.time module
=====================================
.. automodule:: paddlespeech.vector.utils.time
:members:
:undoc-members:
:show-inheritance:
paddlespeech.vector.utils.vector\_utils module
==============================================
.. automodule:: paddlespeech.vector.utils.vector_utils
:members:
:undoc-members:
:show-inheritance:
......@@ -22,6 +22,9 @@
# documentation root, use os.path.abspath to make it absolute, like shown here.
import recommonmark.parser
import sphinx_rtd_theme
import sys
import os
sys.path.insert(0, os.path.abspath('../..'))
autodoc_mock_imports = ["soundfile", "librosa"]
......
......@@ -64,3 +64,18 @@ Contents
:caption: Acknowledgement
asr/reference
.. toctree::
:maxdepth: 2
:caption: API Reference
paddlespeech.audio <api/paddlespeech.audio>
paddlespeech.cli <api/paddlespeech.cli>
paddlespeech.cls <api/paddlespeech.cls>
paddlespeech.kws <api/paddlespeech.kws>
paddlespeech.s2t <api/paddlespeech.s2t>
paddlespeech.server <api/paddlespeech.server>
paddlespeech.t2s <api/paddlespeech.t2s>
paddlespeech.text <api/paddlespeech.text>
paddlespeech.vector <api/paddlespeech.vector>
......@@ -5,8 +5,8 @@ There are 3 ways to use `PaddleSpeech`. According to the degree of difficulty, t
| Way | Function | Support|
|:---- |:----------------------------------------------------------- |:----|
| Easy | (1) Use command-line functions of PaddleSpeech. <br> (2) Experience PaddleSpeech on Ai Studio. | Linux, Mac(not support M1 chip),Windows ( For more information about installation, see [#1195](https://github.com/PaddlePaddle/PaddleSpeech/discussions/1195)) |
| Medium | Support major functions ,such as using the` ready-made `examples and using PaddleSpeech to train your model. | Linux |
| Hard | Support full function of Paddlespeech, including using join ctc decoder with kaldi, training n-gram language model, Montreal-Forced-Aligner, and so on. And you are more able to be a developer! | Ubuntu |
| Medium | Support major functions ,such as using the` ready-made `examples and using PaddleSpeech to train your model. | Linux, Mac(not support M1 chip, not support training models),Windows (not support training models) |
| Hard | Support full function of Paddlespeech, including using join ctc decoder with kaldi([asr2](../../examples/librispeech/asr2 )), training n-gram language model, Montreal-Forced-Aligner, and so on. And you are more able to be a developer! | Ubuntu |
## Prerequisites
- Python >= 3.7
......@@ -65,7 +65,7 @@ pip install paddlespeech -i https://pypi.tuna.tsinghua.edu.cn/simple
> If you fail to install paddlespeech-ctcdecoders, it doesn't matter.
## Medium: Get the Major Functions (Support Linux)
## Medium: Get the Major Functions (Support Linux, mac and windows not support training)
If you want to get the major function of `paddlespeech`, you need to do following steps:
### Git clone PaddleSpeech
You need to `git clone` this repository at first.
......@@ -75,7 +75,7 @@ cd PaddleSpeech
```
### Install Conda
Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to select a version (py>=3.7) and install it by yourself or you can use the following command:
Conda is a management system of the environment. You can go to [minicoda](https://docs.conda.io/en/latest/miniconda.html) to select a version (py>=3.7). For windows, you can follow the installing guide step by step and for linux and mac, you can use the following commands:
```bash
# download the miniconda
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh -P tools/
......
......@@ -4,8 +4,8 @@
| 方式 | 功能 | 支持系统 |
| :--- | :----------------------------------------------------------- | :------------------ |
| 简单 | (1) 使用 PaddleSpeech 的命令行功能. <br> (2) 在 Aistudio上体验 PaddleSpeech. | Linux, Mac(不支持M1芯片),Windows (安装详情查看[#1195](https://github.com/PaddlePaddle/PaddleSpeech/discussions/1195)) |
| 中等 | 支持 PaddleSpeech 主要功能,比如使用已有 examples 中的模型和使用 PaddleSpeech 来训练自己的模型. | Linux |
| 困难 | 支持 PaddleSpeech 的各项功能,包含结合kaldi使用 join ctc decoder 方式解码,训练语言模型,使用强制对齐等。并且你更能成为一名开发者! | Ubuntu |
| 中等 | 支持 PaddleSpeech 主要功能,比如使用已有 examples 中的模型和使用 PaddleSpeech 来训练自己的模型. | Linux, Mac(不支持M1芯片,不支持训练), Windows(不支持训练) |
| 困难 | 支持 PaddleSpeech 的各项功能,包含结合 kaldi 使用 join ctc decoder 方式解码 ([asr2](../../examples/librispeech/asr2 )),训练语言模型,使用强制对齐等。并且你更能成为一名开发者! | Ubuntu |
## 先决条件
- Python >= 3.7
- 最新版本的 PaddlePaddle (请看 [安装向导](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/index_en.html))
......@@ -62,7 +62,7 @@ pip install paddlespeech -i https://pypi.tuna.tsinghua.edu.cn/simple
> 如果出现 paddlespeech-ctcdecoders 无法安装的问题,无须担心,这不影响使用。
## 中等: 获取主要功能(支持 Linux)
## 中等: 获取主要功能(支持 Linux, Mac 和 Windows 不支持训练
如果你想要使用 `paddlespeech` 的主要功能。你需要完成以下几个步骤
### Git clone PaddleSpeech
你需要先 git clone 本仓库
......@@ -71,7 +71,7 @@ git clone https://github.com/PaddlePaddle/PaddleSpeech.git
cd PaddleSpeech
```
### 安装 Conda
Conda 是一个包管理的环境。你可以前往 [minicoda](https://docs.conda.io/en/latest/miniconda.html) 去下载并安装 conda(请下载 py>=3.7 的版本)。你可以尝试自己安装,或者使用以下的命令:
Conda 是一个包管理的环境。你可以前往 [minicoda](https://docs.conda.io/en/latest/miniconda.html) 去下载并安装 conda(请下载 py>=3.7 的版本)。windows 系统可以使用 conda 的向导安装,linux 和 mac 可以使用以下的命令:
```bash
# 下载 miniconda
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh -P tools/
......@@ -116,9 +116,9 @@ conda install -y -c gcc_linux-64=8.4.0 gxx_linux-64=8.4.0
python3 -m pip install paddlepaddle-gpu==2.2.0 -i https://mirror.baidu.com/pypi/simple
```
### 安装 PaddleSpeech
最后安装 `paddlespeech`,这样你就可以使用 `paddlespeech`中已有的 examples:
最后安装 `paddlespeech`,这样你就可以使用 `paddlespeech` 中已有的 examples:
```bash
# 部分用户系统由于默认源的问题,安装中会出现kaldiio安转出错的问题,建议首先安装pytest-runner:
# 部分用户系统由于默认源的问题,安装中会出现 kaldiio 安转出错的问题,建议首先安装pytest-runner:
pip install pytest-runner -i https://pypi.tuna.tsinghua.edu.cn/simple
# 请确保目前处于PaddleSpeech项目的根目录
pip install . -i https://pypi.tuna.tsinghua.edu.cn/simple
......@@ -137,7 +137,7 @@ Docker 是一种开源工具,用于在和系统本身环境相隔离的环境
[Docker Hub](https://hub.docker.com/repository/docker/paddlecloud/paddlespeech) 中获取这些镜像及相应的使用指南,包括 CPU、GPU、ROCm 版本。
如果您对自动化制作docker镜像感兴趣,或有自定义需求,请访问 [PaddlePaddle/PaddleCloud](https://github.com/PaddlePaddle/PaddleCloud/tree/main/tekton) 做进一步了解。
如果您对自动化制作 docker 镜像感兴趣,或有自定义需求,请访问 [PaddlePaddle/PaddleCloud](https://github.com/PaddlePaddle/PaddleCloud/tree/main/tekton) 做进一步了解。
完成这些以后,你就可以在 docker 容器中执行训练、推理和超参 fine-tune。
### 选择2: 使用有 root 权限的 Ubuntu
- 使用apt安装 `build-essential`
......@@ -173,7 +173,7 @@ conda install -y -c conda-forge sox libsndfile swig bzip2 libflac bc
python3 -m pip install paddlepaddle-gpu==2.2.0 -i https://mirror.baidu.com/pypi/simple
```
### 用开发者模式安装 PaddleSpeech
部分用户系统由于默认源的问题,安装中会出现kaldiio安转出错的问题,建议首先安装pytest-runner:
部分用户系统由于默认源的问题,安装中会出现 kaldiio 安转出错的问题,建议首先安装 pytest-runner:
```bash
pip install pytest-runner -i https://pypi.tuna.tsinghua.edu.cn/simple
```
......
# Released Models
## Speech-to-Text Models
......@@ -34,32 +33,33 @@ Language Model | Training Data | Token-based | Size | Descriptions
## Text-to-Speech Models
### Acoustic Models
Model Type | Dataset| Example Link | Pretrained Models|Static Models|Size (static)
Model Type | Dataset| Example Link | Pretrained Models|Static/ONNX Models|Size (static)
:-------------:| :------------:| :-----: | :-----:| :-----:| :-----:
Tacotron2|LJSpeech|[tacotron2-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/tts0)|[tacotron2_ljspeech_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_ljspeech_ckpt_0.2.0.zip)|||
Tacotron2|CSMSC|[tacotron2-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts0)|[tacotron2_csmsc_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_csmsc_ckpt_0.2.0.zip)|[tacotron2_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_csmsc_static_0.2.0.zip)|103MB|
TransformerTTS| LJSpeech| [transformer-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/tts1)|[transformer_tts_ljspeech_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/transformer_tts/transformer_tts_ljspeech_ckpt_0.4.zip)|||
SpeedySpeech| CSMSC | [speedyspeech-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts2)|[speedyspeech_csmsc_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_ckpt_0.2.0.zip)|[speedyspeech_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_static_0.2.0.zip)|12MB|
FastSpeech2| CSMSC |[fastspeech2-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts3)|[fastspeech2_nosil_baker_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_baker_ckpt_0.4.zip)|[fastspeech2_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_csmsc_static_0.2.0.zip)|157MB|
SpeedySpeech| CSMSC | [speedyspeech-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts2)|[speedyspeech_csmsc_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_ckpt_0.2.0.zip)|[speedyspeech_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_static_0.2.0.zip) </br> [speedyspeech_csmsc_onnx_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_onnx_0.2.0.zip)|13MB|
FastSpeech2| CSMSC |[fastspeech2-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts3)|[fastspeech2_nosil_baker_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_baker_ckpt_0.4.zip)|[fastspeech2_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_csmsc_static_0.2.0.zip) </br> [fastspeech2_csmsc_onnx_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_csmsc_onnx_0.2.0.zip)|157MB|
FastSpeech2-Conformer| CSMSC |[fastspeech2-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts3)|[fastspeech2_conformer_baker_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_conformer_baker_ckpt_0.5.zip)|||
FastSpeech2| AISHELL-3 |[fastspeech2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/tts3)|[fastspeech2_nosil_aishell3_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_aishell3_ckpt_0.4.zip)|||
FastSpeech2| LJSpeech |[fastspeech2-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/tts3)|[fastspeech2_nosil_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_ljspeech_ckpt_0.5.zip)|||
FastSpeech2| VCTK |[fastspeech2-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/tts3)|[fastspeech2_nosil_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_vctk_ckpt_0.5.zip)|||
FastSpeech2-CNNDecoder| CSMSC| [fastspeech2-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/tts3)| [fastspeech2_cnndecoder_csmsc_ckpt_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_ckpt_1.0.0.zip) | [fastspeech2_cnndecoder_csmsc_static_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_static_1.0.0.zip) </br>[fastspeech2_cnndecoder_csmsc_streaming_static_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_streaming_static_1.0.0.zip) </br>[fastspeech2_cnndecoder_csmsc_onnx_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_onnx_1.0.0.zip) </br>[fastspeech2_cnndecoder_csmsc_streaming_onnx_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_streaming_onnx_1.0.0.zip) | 84MB|
FastSpeech2| AISHELL-3 |[fastspeech2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/tts3)|[fastspeech2_nosil_aishell3_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_aishell3_ckpt_0.4.zip)|[fastspeech2_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_static_1.1.0.zip) </br> [fastspeech2_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_onnx_1.1.0.zip)|147MB|
FastSpeech2| LJSpeech |[fastspeech2-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/tts3)|[fastspeech2_nosil_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_ljspeech_ckpt_0.5.zip)|[fastspeech2_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_static_1.1.0.zip) </br> [fastspeech2_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_onnx_1.1.0.zip)|145MB|
FastSpeech2| VCTK |[fastspeech2-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/tts3)|[fastspeech2_nosil_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_vctk_ckpt_0.5.zip)|[fastspeech2_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_static_1.1.0.zip) </br> [fastspeech2_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_onnx_1.1.0.zip) | 145MB|
### Vocoders
Model Type | Dataset| Example Link | Pretrained Models| Static Models|Size (static)
Model Type | Dataset| Example Link | Pretrained Models| Static/ONNX Models|Size (static)
:-----:| :-----:| :-----: | :-----:| :-----:| :-----:
WaveFlow| LJSpeech |[waveflow-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc0)|[waveflow_ljspeech_ckpt_0.3.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/waveflow/waveflow_ljspeech_ckpt_0.3.zip)|||
Parallel WaveGAN| CSMSC |[PWGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc1)|[pwg_baker_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_ckpt_0.4.zip)|[pwg_baker_static_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_static_0.4.zip)|5.1MB|
Parallel WaveGAN| LJSpeech |[PWGAN-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc1)|[pwg_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_ljspeech_ckpt_0.5.zip)|||
Parallel WaveGAN| AISHELL-3 |[PWGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc1)|[pwg_aishell3_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_aishell3_ckpt_0.5.zip)|||
Parallel WaveGAN| VCTK |[PWGAN-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/voc1)|[pwg_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_vctk_ckpt_0.5.zip)|||
|Multi Band MelGAN | CSMSC |[MB MelGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc3) | [mb_melgan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_ckpt_0.1.1.zip) <br>[mb_melgan_baker_finetune_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_baker_finetune_ckpt_0.5.zip)|[mb_melgan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_static_0.1.1.zip) |8.2MB|
Parallel WaveGAN| CSMSC |[PWGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc1)|[pwg_baker_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_ckpt_0.4.zip)|[pwg_baker_static_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_baker_static_0.4.zip) </br> [pwgan_csmsc_onnx_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_csmsc_onnx_0.2.0.zip)|4.8MB|
Parallel WaveGAN| LJSpeech |[PWGAN-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc1)|[pwg_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_ljspeech_ckpt_0.5.zip)|[pwgan_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_static_1.1.0.zip) </br> [pwgan_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_onnx_1.1.0.zip)|4.8MB|
Parallel WaveGAN| AISHELL-3 |[PWGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc1)|[pwg_aishell3_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_aishell3_ckpt_0.5.zip)| [pwgan_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_static_1.1.0.zip) </br> [pwgan_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_onnx_1.1.0.zip)|4.8MB|
Parallel WaveGAN| VCTK |[PWGAN-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/voc1)|[pwg_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_vctk_ckpt_0.5.zip)|[pwgan_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_static_1.1.0.zip) </br> [pwgan_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_onnx_1.1.0.zip)|4.8MB|
|Multi Band MelGAN | CSMSC |[MB MelGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc3) | [mb_melgan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_ckpt_0.1.1.zip) <br>[mb_melgan_baker_finetune_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_baker_finetune_ckpt_0.5.zip)|[mb_melgan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_static_0.1.1.zip) </br> [mb_melgan_csmsc_onnx_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/mb_melgan/mb_melgan_csmsc_onnx_0.2.0.zip)|7.6MB|
Style MelGAN | CSMSC |[Style MelGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc4)|[style_melgan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/style_melgan/style_melgan_csmsc_ckpt_0.1.1.zip)| | |
HiFiGAN | CSMSC |[HiFiGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc5)|[hifigan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_ckpt_0.1.1.zip)|[hifigan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_static_0.1.1.zip)|50MB|
HiFiGAN | LJSpeech |[HiFiGAN-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc5)|[hifigan_ljspeech_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_ckpt_0.2.0.zip)|||
HiFiGAN | AISHELL-3 |[HiFiGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc5)|[hifigan_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_ckpt_0.2.0.zip)|||
HiFiGAN | VCTK |[HiFiGAN-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/voc5)|[hifigan_vctk_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_ckpt_0.2.0.zip)|||
HiFiGAN | CSMSC |[HiFiGAN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc5)|[hifigan_csmsc_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_ckpt_0.1.1.zip)|[hifigan_csmsc_static_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_static_0.1.1.zip) </br> [hifigan_csmsc_onnx_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_csmsc_onnx_0.2.0.zip)|46MB|
HiFiGAN | LJSpeech |[HiFiGAN-ljspeech](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/ljspeech/voc5)|[hifigan_ljspeech_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_ckpt_0.2.0.zip)|[hifigan_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_static_1.1.0.zip) </br> [hifigan_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_onnx_1.1.0.zip) |49MB|
HiFiGAN | AISHELL-3 |[HiFiGAN-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/voc5)|[hifigan_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_ckpt_0.2.0.zip)|[hifigan_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_static_1.1.0.zip) </br> [hifigan_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_onnx_1.1.0.zip)|46MB|
HiFiGAN | VCTK |[HiFiGAN-vctk](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/vctk/voc5)|[hifigan_vctk_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_ckpt_0.2.0.zip)|[hifigan_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_static_1.1.0.zip) </br> [hifigan_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_onnx_1.1.0.zip)|46MB|
WaveRNN | CSMSC |[WaveRNN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/csmsc/voc6)|[wavernn_csmsc_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_ckpt_0.2.0.zip)|[wavernn_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_static_0.2.0.zip)|18MB|
......
......@@ -220,6 +220,12 @@ Pretrained FastSpeech2 model with no silence in the edge of audios:
- [fastspeech2_nosil_aishell3_ckpt_0.4.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_aishell3_ckpt_0.4.zip)
- [fastspeech2_conformer_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_conformer_aishell3_ckpt_0.2.0.zip) (Thanks for [@awmmmm](https://github.com/awmmmm)'s contribution)
The static model can be downloaded here:
- [fastspeech2_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [fastspeech2_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_onnx_1.1.0.zip)
FastSpeech2 checkpoint contains files listed below.
```text
......
......@@ -17,3 +17,14 @@ if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
--spk_id=0
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../inference.py \
--inference_dir=${train_output_path}/inference \
--am=fastspeech2_aishell3 \
--voc=hifigan_aishell3 \
--text=${BIN_DIR}/../sentences.txt \
--output_dir=${train_output_path}/pd_infer_out \
--phones_dict=dump/phone_id_map.txt \
--speaker_dict=dump/speaker_id_map.txt \
--spk_id=0
fi
train_output_path=$1
stage=0
stop_stage=0
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_aishell3 \
--voc=pwgan_aishell3 \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--spk_id=0
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_aishell3 \
--voc=hifigan_aishell3 \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--spk_id=0
fi
../../../csmsc/tts3/local/paddle2onnx.sh
\ No newline at end of file
......@@ -27,11 +27,34 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
# synthesize, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference.sh ${train_output_path} || exit -1
fi
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx fastspeech2_aishell3
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_aishell3
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_aishell3
fi
# inference with onnxruntime, use fastspeech2 + hifigan by default
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
./local/ort_predict.sh ${train_output_path}
fi
......@@ -133,6 +133,12 @@ optional arguments:
Pretrained models can be downloaded here:
- [pwg_aishell3_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_aishell3_ckpt_0.5.zip)
The static model can be downloaded here:
- [pwgan_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [pwgan_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_onnx_1.1.0.zip)
Model | Step | eval/generator_loss | eval/log_stft_magnitude_loss:| eval/spectral_convergence_loss
:-------------:| :------------:| :-----: | :-----: | :--------:
default| 1(gpu) x 400000|1.968762|0.759008|0.218524
......
......@@ -116,6 +116,11 @@ optional arguments:
The pretrained model can be downloaded here:
- [hifigan_aishell3_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_ckpt_0.2.0.zip)
The static model can be downloaded here:
- [hifigan_aishell3_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [hifigan_aishell3_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_onnx_1.1.0.zip)
Model | Step | eval/generator_loss | eval/mel_loss| eval/feature_matching_loss
:-------------:| :------------:| :-----: | :-----: | :--------:
......
......@@ -3,22 +3,34 @@ train_output_path=$1
stage=0
stop_stage=0
# only support default_fastspeech2/speedyspeech + hifigan/mb_melgan now!
# synthesize from metadata
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict.py \
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=speedyspeech_csmsc \
--voc=hifigan_csmsc \
--test_metadata=dump/test/norm/metadata.jsonl \
--output_dir=${train_output_path}/onnx_infer_out \
--voc=pwgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--tones_dict=dump/tone_id_map.txt \
--device=cpu \
--cpu_threads=2
fi
# e2e, synthesize from text
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=speedyspeech_csmsc \
--voc=mb_melgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--tones_dict=dump/tone_id_map.txt \
--device=cpu \
--cpu_threads=2
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=speedyspeech_csmsc \
......@@ -30,3 +42,15 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
--device=cpu \
--cpu_threads=2
fi
# synthesize from metadata
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
python3 ${BIN_DIR}/../ort_predict.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=speedyspeech_csmsc \
--voc=hifigan_csmsc \
--test_metadata=dump/test/norm/metadata.jsonl \
--output_dir=${train_output_path}/onnx_infer_out \
--device=cpu \
--cpu_threads=2
fi
......@@ -27,12 +27,12 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
# synthesize, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
......@@ -46,19 +46,17 @@ fi
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.5' ]]; then
pip install paddle2onnx==0.9.5
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx speedyspeech_csmsc
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx mb_melgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
fi
# inference with onnxruntime, use fastspeech2 + hifigan by default
# inference with onnxruntime
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
# install onnxruntime
version=$(echo `pip list |grep "onnxruntime"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '1.10.0' ]]; then
pip install onnxruntime==1.10.0
fi
./local/ort_predict.sh ${train_output_path}
fi
......@@ -3,22 +3,32 @@ train_output_path=$1
stage=0
stop_stage=0
# only support default_fastspeech2/speedyspeech + hifigan/mb_melgan now!
# synthesize from metadata
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict.py \
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_csmsc \
--voc=hifigan_csmsc \
--test_metadata=dump/test/norm/metadata.jsonl \
--output_dir=${train_output_path}/onnx_infer_out \
--voc=pwgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2
fi
# e2e, synthesize from text
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_csmsc \
--voc=mb_melgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_csmsc \
......@@ -29,3 +39,15 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
--device=cpu \
--cpu_threads=2
fi
# synthesize from metadata, take hifigan as an example
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
python3 ${BIN_DIR}/../ort_predict.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_csmsc \
--voc=hifigan_csmsc \
--test_metadata=dump/test/norm/metadata.jsonl \
--output_dir=${train_output_path}/onnx_infer_out \
--device=cpu \
--cpu_threads=2
fi
\ No newline at end of file
......@@ -5,6 +5,34 @@ stop_stage=0
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict_streaming.py \
--inference_dir=${train_output_path}/inference_onnx_streaming \
--am=fastspeech2_csmsc \
--am_stat=dump/train/speech_stats.npy \
--voc=pwgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_streaming \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--am_streaming=True
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_streaming.py \
--inference_dir=${train_output_path}/inference_onnx_streaming \
--am=fastspeech2_csmsc \
--am_stat=dump/train/speech_stats.npy \
--voc=mb_melgan_csmsc \
--output_dir=${train_output_path}/onnx_infer_out_streaming \
--text=${BIN_DIR}/../csmsc_test.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--am_streaming=True
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
python3 ${BIN_DIR}/../ort_predict_streaming.py \
--inference_dir=${train_output_path}/inference_onnx_streaming \
--am=fastspeech2_csmsc \
......
......@@ -24,7 +24,8 @@ if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
--text=${BIN_DIR}/../sentences.txt \
--output_dir=${train_output_path}/test_e2e_streaming \
--phones_dict=dump/phone_id_map.txt \
--am_streaming=True
--am_streaming=True \
--inference_dir=${train_output_path}/inference_streaming
fi
# for more GAN Vocoders
......@@ -45,7 +46,8 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
--text=${BIN_DIR}/../sentences.txt \
--output_dir=${train_output_path}/test_e2e_streaming \
--phones_dict=dump/phone_id_map.txt \
--am_streaming=True
--am_streaming=True \
--inference_dir=${train_output_path}/inference_streaming
fi
# the pretrained models haven't release now
......
......@@ -27,17 +27,17 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
# synthesize, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
# inference with static model
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference.sh ${train_output_path} || exit -1
fi
......@@ -46,15 +46,18 @@ fi
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.5' ]]; then
pip install paddle2onnx==0.9.5
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx fastspeech2_csmsc
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx mb_melgan_csmsc
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx mb_melgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
fi
# inference with onnxruntime, use fastspeech2 + hifigan by default
# inference with onnxruntime
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
./local/ort_predict.sh ${train_output_path}
fi
......@@ -33,25 +33,25 @@ fi
# synthesize_e2e non-streaming
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
# inference non-streaming
if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
# inference with static model
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference.sh ${train_output_path} || exit -1
fi
# synthesize_e2e streaming
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_streaming.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
# inference streaming
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
# inference with static model
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference_streaming.sh ${train_output_path} || exit -1
fi
......@@ -59,32 +59,37 @@ fi
if [ ${stage} -le 7 ] && [ ${stop_stage} -ge 7 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.5' ]]; then
pip install paddle2onnx==0.9.5
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx fastspeech2_csmsc
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx mb_melgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_csmsc
fi
# onnxruntime non streaming
# inference with onnxruntime, use fastspeech2 + hifigan by default
if [ ${stage} -le 8 ] && [ ${stop_stage} -ge 8 ]; then
./local/ort_predict.sh ${train_output_path}
fi
# paddle2onnx streaming
if [ ${stage} -le 9 ] && [ ${stop_stage} -ge 9 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.5' ]]; then
pip install paddle2onnx==0.9.5
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
# streaming acoustic model
./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming fastspeech2_csmsc_am_encoder_infer
./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming fastspeech2_csmsc_am_decoder
./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming fastspeech2_csmsc_am_postnet
# vocoder
./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming hifigan_csmsc
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming pwgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming mb_melgan_csmsc
# ./local/paddle2onnx.sh ${train_output_path} inference_streaming inference_onnx_streaming hifigan_csmsc
fi
# onnxruntime streaming
......
......@@ -215,6 +215,13 @@ optional arguments:
Pretrained FastSpeech2 model with no silence in the edge of audios:
- [fastspeech2_nosil_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_ljspeech_ckpt_0.5.zip)
The static model can be downloaded here:
- [fastspeech2_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [fastspeech2_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_onnx_1.1.0.zip)
Model | Step | eval/loss | eval/l1_loss | eval/duration_loss | eval/pitch_loss| eval/energy_loss
:-------------:| :------------:| :-----: | :-----: | :--------: |:--------:|:---------:
default| 2(gpu) x 100000| 1.505682|0.612104| 0.045505| 0.62792| 0.220147
......
#!/bin/bash
train_output_path=$1
stage=0
stop_stage=0
# pwgan
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../inference.py \
--inference_dir=${train_output_path}/inference \
--am=fastspeech2_ljspeech \
--voc=pwgan_ljspeech \
--text=${BIN_DIR}/../sentences_en.txt \
--output_dir=${train_output_path}/pd_infer_out \
--phones_dict=dump/phone_id_map.txt \
--lang=en
fi
# hifigan
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../inference.py \
--inference_dir=${train_output_path}/inference \
--am=fastspeech2_ljspeech \
--voc=hifigan_ljspeech \
--text=${BIN_DIR}/../sentences_en.txt \
--output_dir=${train_output_path}/pd_infer_out \
--phones_dict=dump/phone_id_map.txt \
--lang=en
fi
train_output_path=$1
stage=0
stop_stage=0
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_ljspeech \
--voc=pwgan_ljspeech\
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../sentences_en.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--lang=en
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_ljspeech \
--voc=hifigan_ljspeech \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../sentences_en.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--lang=en
fi
../../../csmsc/tts3/local/paddle2onnx.sh
\ No newline at end of file
......@@ -27,11 +27,35 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
# synthesize, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference.sh ${train_output_path} || exit -1
fi
# paddle2onnx, please make sure the static models are in ${train_output_path}/inference first
# we have only tested the following models so far
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx fastspeech2_ljspeech
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_ljspeech
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_ljspeech
fi
# inference with onnxruntime, use fastspeech2 + hifigan by default
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
./local/ort_predict.sh ${train_output_path}
fi
......@@ -130,6 +130,13 @@ optional arguments:
Pretrained models can be downloaded here:
- [pwg_ljspeech_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_ljspeech_ckpt_0.5.zip)
The static model can be downloaded here:
- [pwgan_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [pwgan_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_onnx_1.1.0.zip)
Parallel WaveGAN checkpoint contains files listed below.
```text
......
......@@ -115,6 +115,12 @@ optional arguments:
The pretrained model can be downloaded here:
- [hifigan_ljspeech_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_ckpt_0.2.0.zip)
The static model can be downloaded here:
- [hifigan_ljspeech_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [hifigan_ljspeech_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_onnx_1.1.0.zip)
Model | Step | eval/generator_loss | eval/mel_loss| eval/feature_matching_loss
:-------------:| :------------:| :-----: | :-----: | :--------:
......
......@@ -218,6 +218,12 @@ optional arguments:
Pretrained FastSpeech2 model with no silence in the edge of audios:
- [fastspeech2_nosil_vctk_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_vctk_ckpt_0.5.zip)
The static model can be downloaded here:
- [fastspeech2_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [fastspeech2_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_onnx_1.1.0.zip)
FastSpeech2 checkpoint contains files listed below.
```text
fastspeech2_nosil_vctk_ckpt_0.5
......
......@@ -18,3 +18,15 @@ if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
--lang=en
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../inference.py \
--inference_dir=${train_output_path}/inference \
--am=fastspeech2_vctk \
--voc=hifigan_vctk \
--text=${BIN_DIR}/../sentences_en.txt \
--output_dir=${train_output_path}/pd_infer_out \
--phones_dict=dump/phone_id_map.txt \
--speaker_dict=dump/speaker_id_map.txt \
--spk_id=0 \
--lang=en
fi
train_output_path=$1
stage=0
stop_stage=0
# e2e, synthesize from text
if [ ${stage} -le 0 ] && [ ${stop_stage} -ge 0 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_vctk \
--voc=pwgan_vctk \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../sentences_en.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--spk_id=0 \
--lang=en
fi
if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
python3 ${BIN_DIR}/../ort_predict_e2e.py \
--inference_dir=${train_output_path}/inference_onnx \
--am=fastspeech2_vctk \
--voc=hifigan_vctk \
--output_dir=${train_output_path}/onnx_infer_out_e2e \
--text=${BIN_DIR}/../sentences_en.txt \
--phones_dict=dump/phone_id_map.txt \
--device=cpu \
--cpu_threads=2 \
--spk_id=0 \
--lang=en
fi
../../../csmsc/tts3/local/paddle2onnx.sh
\ No newline at end of file
......@@ -27,11 +27,34 @@ if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
fi
if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then
# synthesize, vocoder is pwgan
# synthesize, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 3 ] && [ ${stop_stage} -ge 3 ]; then
# synthesize_e2e, vocoder is pwgan
# synthesize_e2e, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/synthesize_e2e.sh ${conf_path} ${train_output_path} ${ckpt_name} || exit -1
fi
if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
# inference with static model, vocoder is pwgan by default
CUDA_VISIBLE_DEVICES=${gpus} ./local/inference.sh ${train_output_path} || exit -1
fi
if [ ${stage} -le 5 ] && [ ${stop_stage} -ge 5 ]; then
# install paddle2onnx
version=$(echo `pip list |grep "paddle2onnx"` |awk -F" " '{print $2}')
if [[ -z "$version" || ${version} != '0.9.8' ]]; then
pip install paddle2onnx==0.9.8
fi
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx fastspeech2_vctk
# considering the balance between speed and quality, we recommend that you use hifigan as vocoder
./local/paddle2onnx.sh ${train_output_path} inference inference_onnx pwgan_vctk
# ./local/paddle2onnx.sh ${train_output_path} inference inference_onnx hifigan_vctk
fi
# inference with onnxruntime, use fastspeech2 + hifigan by default
if [ ${stage} -le 6 ] && [ ${stop_stage} -ge 6 ]; then
./local/ort_predict.sh ${train_output_path}
fi
......@@ -135,6 +135,13 @@ optional arguments:
Pretrained models can be downloaded here:
- [pwg_vctk_ckpt_0.1.1.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwg_vctk_ckpt_0.1.1.zip)
The static model can be downloaded here:
- [pwgan_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [pwgan_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_onnx_1.1.0.zip)
Parallel WaveGAN checkpoint contains files listed below.
```text
......
......@@ -121,6 +121,12 @@ optional arguments:
The pretrained model can be downloaded here:
- [hifigan_vctk_ckpt_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_ckpt_0.2.0.zip)
The static model can be downloaded here:
- [hifigan_vctk_static_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_static_1.1.0.zip)
The ONNX model can be downloaded here:
- [hifigan_vctk_onnx_1.1.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_onnx_1.1.0.zip)
Model | Step | eval/generator_loss | eval/mel_loss| eval/feature_matching_loss
:-------------:| :------------:| :-----: | :-----: | :--------:
......
# copy this to root directory of data and
# chmod a+x convert.sh
# ./convert.sh
# https://unix.stackexchange.com/questions/103920/parallelize-a-bash-for-loop
dir=$1
open_sem(){
mkfifo pipe-$$
exec 3<>pipe-$$
rm pipe-$$
local i=$1
for((;i>0;i--)); do
printf %s 000 >&3
done
}
run_with_lock(){
local x
read -u 3 -n 3 x && ((0==x)) || exit $x
(
( "$@"; )
printf '%.3d' $? >&3
)&
}
N=32 # number of vCPU
open_sem $N
for f in $(find ${dir} -name "*.m4a"); do
run_with_lock ffmpeg -loglevel panic -i "$f" -ar 16000 "${f%.*}.wav"
done
......@@ -7,7 +7,7 @@ gpus=0,1,2,3,4,5,6,7
stage=0
stop_stage=100
conf_path=conf/conformer.yaml
ips= #xxx.xxx.xxx, xxx.xxx.xxx.xxx
ips= #xxx.xxx.xxx.xxx,xxx.xxx.xxx.xxx
decode_conf_path=conf/tuning/decode.yaml
average_checkpoint=true
avg_num=10
......
......@@ -44,7 +44,7 @@ from .filters import (
cmvn,
placeholder,
)
from webdataset.handlers import (
from .handlers import (
ignore_and_continue,
ignore_and_stop,
reraise_exception,
......
......@@ -94,7 +94,7 @@ class StatsCommand:
def __init__(self):
self.parser = argparse.ArgumentParser(
prog='paddlespeech.stats', add_help=True)
self.task_choices = ['asr', 'cls', 'st', 'text', 'tts', 'vector']
self.task_choices = ['asr', 'cls', 'st', 'text', 'tts', 'vector', 'kws']
self.parser.add_argument(
'--task',
type=str,
......@@ -138,6 +138,7 @@ _commands = {
'text': ['Text command.', 'TextExecutor'],
'tts': ['Text to Speech infer command.', 'TTSExecutor'],
'vector': ['Speech to vector embedding infer command.', 'VectorExecutor'],
'kws': ['Keyword Spotting infer command.', 'KWSExecutor'],
}
for com, info in _commands.items():
......
# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
from .infer import KWSExecutor
# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
import argparse
import os
from collections import OrderedDict
from typing import List
from typing import Optional
from typing import Union
import paddle
import yaml
from ..executor import BaseExecutor
from ..log import logger
from ..utils import stats_wrapper
from paddlespeech.audio import load
from paddlespeech.audio.compliance.kaldi import fbank as kaldi_fbank
__all__ = ['KWSExecutor']
class KWSExecutor(BaseExecutor):
def __init__(self):
super().__init__(task='kws')
self.parser = argparse.ArgumentParser(
prog='paddlespeech.kws', add_help=True)
self.parser.add_argument(
'--input',
type=str,
default=None,
help='Audio file to keyword spotting.')
self.parser.add_argument(
'--threshold',
type=float,
default=0.8,
help='Score threshold for keyword spotting.')
self.parser.add_argument(
'--model',
type=str,
default='mdtc_heysnips',
choices=[
tag[:tag.index('-')]
for tag in self.task_resource.pretrained_models.keys()
],
help='Choose model type of kws task.')
self.parser.add_argument(
'--config',
type=str,
default=None,
help='Config of kws task. Use deault config when it is None.')
self.parser.add_argument(
'--ckpt_path',
type=str,
default=None,
help='Checkpoint file of model.')
self.parser.add_argument(
'--device',
type=str,
default=paddle.get_device(),
help='Choose device to execute model inference.')
self.parser.add_argument(
'-d',
'--job_dump_result',
action='store_true',
help='Save job result into file.')
self.parser.add_argument(
'-v',
'--verbose',
action='store_true',
help='Increase logger verbosity of current task.')
def _init_from_path(self,
model_type: str='mdtc_heysnips',
cfg_path: Optional[os.PathLike]=None,
ckpt_path: Optional[os.PathLike]=None):
"""
Init model and other resources from a specific path.
"""
if hasattr(self, 'model'):
logger.info('Model had been initialized.')
return
if ckpt_path is None:
tag = model_type + '-' + '16k'
self.task_resource.set_task_model(tag)
self.cfg_path = os.path.join(
self.task_resource.res_dir,
self.task_resource.res_dict['cfg_path'])
self.ckpt_path = os.path.join(
self.task_resource.res_dir,
self.task_resource.res_dict['ckpt_path'] + '.pdparams')
else:
self.cfg_path = os.path.abspath(cfg_path)
self.ckpt_path = os.path.abspath(ckpt_path)
# config
with open(self.cfg_path, 'r') as f:
config = yaml.safe_load(f)
# model
backbone_class = self.task_resource.get_model_class(
model_type.split('_')[0])
model_class = self.task_resource.get_model_class(
model_type.split('_')[0] + '_for_kws')
backbone = backbone_class(
stack_num=config['stack_num'],
stack_size=config['stack_size'],
in_channels=config['in_channels'],
res_channels=config['res_channels'],
kernel_size=config['kernel_size'],
causal=True, )
self.model = model_class(
backbone=backbone, num_keywords=config['num_keywords'])
model_dict = paddle.load(self.ckpt_path)
self.model.set_state_dict(model_dict)
self.model.eval()
self.feature_extractor = lambda x: kaldi_fbank(
x, sr=config['sample_rate'],
frame_shift=config['frame_shift'],
frame_length=config['frame_length'],
n_mels=config['n_mels']
)
def preprocess(self, audio_file: Union[str, os.PathLike]):
"""
Input preprocess and return paddle.Tensor stored in self.input.
Input content can be a text(tts), a file(asr, cls) or a streaming(not supported yet).
"""
assert os.path.isfile(audio_file)
waveform, _ = load(audio_file)
if isinstance(audio_file, (str, os.PathLike)):
logger.info("Preprocessing audio_file:" + audio_file)
# Feature extraction
waveform = paddle.to_tensor(waveform).unsqueeze(0)
self._inputs['feats'] = self.feature_extractor(waveform).unsqueeze(0)
@paddle.no_grad()
def infer(self):
"""
Model inference and result stored in self.output.
"""
self._outputs['logits'] = self.model(self._inputs['feats'])
def postprocess(self, threshold: float) -> Union[str, os.PathLike]:
"""
Output postprocess and return human-readable results such as texts and audio files.
"""
kws_score = max(self._outputs['logits'][0, :, 0]).item()
return 'Score: {:.3f}, Threshold: {}, Is keyword: {}'.format(
kws_score, threshold, kws_score > threshold)
def execute(self, argv: List[str]) -> bool:
"""
Command line entry.
"""
parser_args = self.parser.parse_args(argv)
model_type = parser_args.model
cfg_path = parser_args.config
ckpt_path = parser_args.ckpt_path
device = parser_args.device
threshold = parser_args.threshold
if not parser_args.verbose:
self.disable_task_loggers()
task_source = self.get_input_source(parser_args.input)
task_results = OrderedDict()
has_exceptions = False
for id_, input_ in task_source.items():
try:
res = self(input_, threshold, model_type, cfg_path, ckpt_path,
device)
task_results[id_] = res
except Exception as e:
has_exceptions = True
task_results[id_] = f'{e.__class__.__name__}: {e}'
self.process_task_results(parser_args.input, task_results,
parser_args.job_dump_result)
if has_exceptions:
return False
else:
return True
@stats_wrapper
def __call__(self,
audio_file: os.PathLike,
threshold: float=0.8,
model: str='mdtc_heysnips',
config: Optional[os.PathLike]=None,
ckpt_path: Optional[os.PathLike]=None,
device: str=paddle.get_device()):
"""
Python API to call an executor.
"""
audio_file = os.path.abspath(os.path.expanduser(audio_file))
paddle.set_device(device)
self._init_from_path(model, config, ckpt_path)
self.preprocess(audio_file)
self.infer()
res = self.postprocess(threshold)
return res
......@@ -83,4 +83,10 @@ model_alias = {
# ------------ Vector -------------
# ---------------------------------
"ecapatdnn": ["paddlespeech.vector.models.ecapa_tdnn:EcapaTdnn"],
# ---------------------------------
# -------------- kws --------------
# ---------------------------------
"mdtc": ["paddlespeech.kws.models.mdtc:MDTC"],
"mdtc_for_kws": ["paddlespeech.kws.models.mdtc:KWSModel"],
}
......@@ -639,6 +639,22 @@ tts_dynamic_pretrained_models = {
'speaker_id_map.txt',
},
},
"fastspeech2_cnndecoder_csmsc-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_ckpt_1.0.0.zip',
'md5':
'6eb28e22ace73e0ebe7845f86478f89f',
'config':
'cnndecoder.yaml',
'ckpt':
'snapshot_iter_153000.pdz',
'speech_stats':
'speech_stats.npy',
'phones_dict':
'phone_id_map.txt',
},
},
# tacotron2
"tacotron2_csmsc-zh": {
'1.0': {
......@@ -831,22 +847,6 @@ tts_dynamic_pretrained_models = {
'feats_stats.npy',
},
},
"fastspeech2_cnndecoder_csmsc-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_cnndecoder_csmsc_ckpt_1.0.0.zip',
'md5':
'6eb28e22ace73e0ebe7845f86478f89f',
'config':
'cnndecoder.yaml',
'ckpt':
'snapshot_iter_153000.pdz',
'speech_stats':
'speech_stats.npy',
'phones_dict':
'phone_id_map.txt',
},
},
}
tts_static_pretrained_models = {
......@@ -886,6 +886,58 @@ tts_static_pretrained_models = {
24000,
},
},
"fastspeech2_ljspeech-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_static_1.1.0.zip',
'md5':
'c49f70b52973423ec45aaa6184fb5bc6',
'model':
'fastspeech2_ljspeech.pdmodel',
'params':
'fastspeech2_ljspeech.pdiparams',
'phones_dict':
'phone_id_map.txt',
'sample_rate':
22050,
},
},
"fastspeech2_aishell3-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_static_1.1.0.zip',
'md5':
'695af44679f48eb4abc159977ddaee16',
'model':
'fastspeech2_aishell3.pdmodel',
'params':
'fastspeech2_aishell3.pdiparams',
'phones_dict':
'phone_id_map.txt',
'speaker_dict':
'speaker_id_map.txt',
'sample_rate':
24000,
},
},
"fastspeech2_vctk-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_static_1.1.0.zip',
'md5':
'92d8c082f180bda2fd05a534fb4a1b62',
'model':
'fastspeech2_vctk.pdmodel',
'params':
'fastspeech2_vctk.pdiparams',
'phones_dict':
'phone_id_map.txt',
'speaker_dict':
'speaker_id_map.txt',
'sample_rate':
24000,
},
},
# pwgan
"pwgan_csmsc-zh": {
'1.0': {
......@@ -901,6 +953,48 @@ tts_static_pretrained_models = {
24000,
},
},
"pwgan_ljspeech-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_static_1.1.0.zip',
'md5':
'6f457a069da99c6814ac1fb4677281e4',
'model':
'pwgan_ljspeech.pdmodel',
'params':
'pwgan_ljspeech.pdiparams',
'sample_rate':
22050,
},
},
"pwgan_aishell3-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_static_1.1.0.zip',
'md5':
'199f64010238275fbdacb326a5cf82d1',
'model':
'pwgan_aishell3.pdmodel',
'params':
'pwgan_aishell3.pdiparams',
'sample_rate':
24000,
},
},
"pwgan_vctk-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_static_1.1.0.zip',
'md5':
'ee0fc571ad5a7fbe4ca20e49df22b819',
'model':
'pwgan_vctk.pdmodel',
'params':
'pwgan_vctk.pdiparams',
'sample_rate':
24000,
},
},
# mb_melgan
"mb_melgan_csmsc-zh": {
'1.0': {
......@@ -931,9 +1025,67 @@ tts_static_pretrained_models = {
24000,
},
},
"hifigan_ljspeech-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_static_1.1.0.zip',
'md5':
'8c674e79be7c45f6eda74825316438a0',
'model':
'hifigan_ljspeech.pdmodel',
'params':
'hifigan_ljspeech.pdiparams',
'sample_rate':
22050,
},
},
"hifigan_aishell3-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_static_1.1.0.zip',
'md5':
'7a10ec5d8d851e2000128f040d30cc01',
'model':
'hifigan_aishell3.pdmodel',
'params':
'hifigan_aishell3.pdiparams',
'sample_rate':
24000,
},
},
"hifigan_vctk-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_static_1.1.0.zip',
'md5':
'130f791dfac84ccdd44ccbdfb67bf08e',
'model':
'hifigan_vctk.pdmodel',
'params':
'hifigan_vctk.pdiparams',
'sample_rate':
24000,
},
},
}
tts_onnx_pretrained_models = {
# speedyspeech
"speedyspeech_csmsc_onnx-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/speedyspeech/speedyspeech_csmsc_onnx_0.2.0.zip',
'md5':
'3e9c45af9ef70675fc1968ed5074fc88',
'ckpt': ['speedyspeech_csmsc.onnx'],
'phones_dict':
'phone_id_map.txt',
'tones_dict':
'tone_id_map.txt',
'sample_rate':
24000,
},
},
# fastspeech2
"fastspeech2_csmsc_onnx-zh": {
'1.0': {
......@@ -948,6 +1100,49 @@ tts_onnx_pretrained_models = {
24000,
},
},
"fastspeech2_ljspeech_onnx-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_ljspeech_onnx_1.1.0.zip',
'md5':
'00754307636a48c972a5f3e65cda3d18',
'ckpt': ['fastspeech2_ljspeech.onnx'],
'phones_dict':
'phone_id_map.txt',
'sample_rate':
22050,
},
},
"fastspeech2_aishell3_onnx-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_onnx_1.1.0.zip',
'md5':
'a1d6ee21de897ce394f5469e2bb4df0d',
'ckpt': ['fastspeech2_aishell3.onnx'],
'phones_dict':
'phone_id_map.txt',
'speaker_dict':
'speaker_id_map.txt',
'sample_rate':
24000,
},
},
"fastspeech2_vctk_onnx-en": {
'1.0': {
'url':
'hhttps://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_vctk_onnx_1.1.0.zip',
'md5':
'd9c3a9b02204a2070504dd99f5f959bf',
'ckpt': ['fastspeech2_vctk.onnx'],
'phones_dict':
'phone_id_map.txt',
'speaker_dict':
'speaker_id_map.txt',
'sample_rate':
24000,
},
},
"fastspeech2_cnndecoder_csmsc_onnx-zh": {
'1.0': {
'url':
......@@ -967,6 +1162,55 @@ tts_onnx_pretrained_models = {
24000,
},
},
# pwgan
"pwgan_csmsc_onnx-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_csmsc_onnx_0.2.0.zip',
'md5':
'711d0ade33e73f3b721efc9f20669f9c',
'ckpt':
'pwgan_csmsc.onnx',
'sample_rate':
24000,
},
},
"pwgan_ljspeech_onnx-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_ljspeech_onnx_1.1.0.zip',
'md5':
'73cdeeccb77f2ea6ed4d07e71d8ac8b8',
'ckpt':
'pwgan_ljspeech.onnx',
'sample_rate':
22050,
},
},
"pwgan_aishell3_onnx-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_aishell3_onnx_1.1.0.zip',
'md5':
'096ab64e152a4fa476aff79ebdadb01b',
'ckpt':
'pwgan_aishell3.onnx',
'sample_rate':
24000,
},
},
"pwgan_vctk_onnx-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/pwgan/pwgan_vctk_onnx_1.1.0.zip',
'md5':
'4e754d42cf85f6428f0af887c923d86c',
'ckpt':
'pwgan_vctk.onnx',
'sample_rate':
24000,
},
},
# mb_melgan
"mb_melgan_csmsc_onnx-zh": {
'1.0': {
......@@ -993,6 +1237,42 @@ tts_onnx_pretrained_models = {
24000,
},
},
"hifigan_ljspeech_onnx-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_onnx_1.1.0.zip',
'md5':
'062f54b79c1135a50adb5fc8406260b2',
'ckpt':
'hifigan_ljspeech.onnx',
'sample_rate':
22050,
},
},
"hifigan_aishell3_onnx-zh": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_aishell3_onnx_1.1.0.zip',
'md5':
'd6c0d684ad148583ca57837d5e870167',
'ckpt':
'hifigan_aishell3.onnx',
'sample_rate':
24000,
},
},
"hifigan_vctk_onnx-en": {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_vctk_onnx_1.1.0.zip',
'md5':
'fd714df3be283c0efbefc8510160ff6d',
'ckpt':
'hifigan_vctk.onnx',
'sample_rate':
24000,
},
},
}
# ---------------------------------
......@@ -1014,3 +1294,21 @@ vector_dynamic_pretrained_models = {
},
},
}
# ---------------------------------
# ------------- KWS ---------------
# ---------------------------------
kws_dynamic_pretrained_models = {
'mdtc_heysnips-16k': {
'1.0': {
'url':
'https://paddlespeech.bj.bcebos.com/kws/heysnips/kws0_mdtc_heysnips_ckpt.tar.gz',
'md5':
'c0de0a9520d66c3c8d6679460893578f',
'cfg_path':
'conf/mdtc.yaml',
'ckpt_path':
'ckpt/model',
},
},
}
......@@ -22,7 +22,7 @@ from ..utils.dynamic_import import dynamic_import
from ..utils.env import MODEL_HOME
from .model_alias import model_alias
task_supported = ['asr', 'cls', 'st', 'text', 'tts', 'vector']
task_supported = ['asr', 'cls', 'st', 'text', 'tts', 'vector', 'kws']
model_format_supported = ['dynamic', 'static', 'onnx']
inference_mode_supported = ['online', 'offline']
......@@ -164,7 +164,6 @@ class CommonTaskResource:
try:
import_models = '{}_{}_pretrained_models'.format(self.task,
self.model_format)
print(f"from .pretrained_models import {import_models}")
exec('from .pretrained_models import {}'.format(import_models))
models = OrderedDict(locals()[import_models])
except Exception as e:
......
......@@ -99,7 +99,7 @@ class Log():
_call_from_cli = False
_frame = inspect.currentframe()
while _frame:
if 'paddlespeech/cli/__init__.py' in _frame.f_code.co_filename or 'paddlespeech/t2s' in _frame.f_code.co_filename:
if 'paddlespeech/cli/entry.py' in _frame.f_code.co_filename or 'paddlespeech/t2s' in _frame.f_code.co_filename:
_call_from_cli = True
break
_frame = _frame.f_back
......
......@@ -66,15 +66,15 @@ class TTSServerExecutor(TTSExecutor):
return
# am
am_tag = am + '-' + lang
self.task_resource.set_task_model(
model_tag=am_tag,
model_type=0, # am
version=None, # default version
)
self.am_res_path = self.task_resource.res_dir
if am == "fastspeech2_csmsc_onnx":
# get model info
if am_ckpt is None or phones_dict is None:
self.task_resource.set_task_model(
model_tag=am_tag,
model_type=0, # am
version=None, # default version
)
self.am_res_path = self.task_resource.res_dir
self.am_ckpt = os.path.join(
self.am_res_path, self.task_resource.res_dict['ckpt'][0])
# must have phones_dict in acoustic
......@@ -86,13 +86,19 @@ class TTSServerExecutor(TTSExecutor):
self.am_ckpt = os.path.abspath(am_ckpt[0])
self.phones_dict = os.path.abspath(phones_dict)
self.am_res_path = os.path.dirname(
os.path.abspath(self.am_ckpt))
os.path.abspath(am_ckpt))
# create am sess
self.am_sess = get_sess(self.am_ckpt, am_sess_conf)
elif am == "fastspeech2_cnndecoder_csmsc_onnx":
if am_ckpt is None or am_stat is None or phones_dict is None:
self.task_resource.set_task_model(
model_tag=am_tag,
model_type=0, # am
version=None, # default version
)
self.am_res_path = self.task_resource.res_dir
self.am_encoder_infer = os.path.join(
self.am_res_path, self.task_resource.res_dict['ckpt'][0])
self.am_decoder = os.path.join(
......@@ -114,7 +120,7 @@ class TTSServerExecutor(TTSExecutor):
self.phones_dict = os.path.abspath(phones_dict)
self.am_stat = os.path.abspath(am_stat)
self.am_res_path = os.path.dirname(
os.path.abspath(self.am_ckpt))
os.path.abspath(am_ckpt[0]))
# create am sess
self.am_encoder_infer_sess = get_sess(self.am_encoder_infer,
......@@ -130,12 +136,13 @@ class TTSServerExecutor(TTSExecutor):
# voc model info
voc_tag = voc + '-' + lang
self.task_resource.set_task_model(
model_tag=voc_tag,
model_type=1, # vocoder
version=None, # default version
)
if voc_ckpt is None:
self.task_resource.set_task_model(
model_tag=voc_tag,
model_type=1, # vocoder
version=None, # default version
)
self.voc_res_path = self.task_resource.voc_res_dir
self.voc_ckpt = os.path.join(
self.voc_res_path, self.task_resource.voc_res_dict['ckpt'])
......
......@@ -35,8 +35,12 @@ def parse_args():
type=str,
default='fastspeech2_csmsc',
choices=[
'speedyspeech_csmsc', 'fastspeech2_csmsc', 'fastspeech2_aishell3',
'fastspeech2_vctk', 'tacotron2_csmsc'
'speedyspeech_csmsc',
'fastspeech2_csmsc',
'fastspeech2_aishell3',
'fastspeech2_ljspeech',
'fastspeech2_vctk',
'tacotron2_csmsc',
],
help='Choose acoustic model type of tts task.')
parser.add_argument(
......@@ -56,8 +60,16 @@ def parse_args():
type=str,
default='pwgan_csmsc',
choices=[
'pwgan_csmsc', 'mb_melgan_csmsc', 'hifigan_csmsc', 'pwgan_aishell3',
'pwgan_vctk', 'wavernn_csmsc'
'pwgan_csmsc',
'pwgan_aishell3',
'pwgan_ljspeech',
'pwgan_vctk',
'mb_melgan_csmsc',
'hifigan_csmsc',
'hifigan_aishell3',
'hifigan_ljspeech',
'hifigan_vctk',
'wavernn_csmsc',
],
help='Choose vocoder type of tts task.')
# other
......
......@@ -54,19 +54,31 @@ def ort_predict(args):
device=args.device,
cpu_threads=args.cpu_threads)
merge_sentences = True
# frontend warmup
# Loading model cost 0.5+ seconds
if args.lang == 'zh':
frontend.get_input_ids("你好,欢迎使用飞桨框架进行深度学习研究!", merge_sentences=True)
frontend.get_input_ids(
"你好,欢迎使用飞桨框架进行深度学习研究!", merge_sentences=merge_sentences)
else:
print("lang should in be 'zh' here!")
frontend.get_input_ids(
"hello, thank you, thank you very much",
merge_sentences=merge_sentences)
# am warmup
spk_id = [args.spk_id]
for T in [27, 38, 54]:
am_input_feed = {}
if am_name == 'fastspeech2':
phone_ids = np.random.randint(1, 266, size=(T, ))
if args.lang == 'en':
phone_ids = np.random.randint(1, 78, size=(T, ))
else:
phone_ids = np.random.randint(1, 266, size=(T, ))
am_input_feed.update({'text': phone_ids})
if am_dataset in {"aishell3", "vctk"}:
am_input_feed.update({'spk_id': spk_id})
elif am_name == 'speedyspeech':
phone_ids = np.random.randint(1, 92, size=(T, ))
tone_ids = np.random.randint(1, 5, size=(T, ))
......@@ -96,12 +108,18 @@ def ort_predict(args):
phone_ids = input_ids["phone_ids"]
if get_tone_ids:
tone_ids = input_ids["tone_ids"]
elif args.lang == 'en':
input_ids = frontend.get_input_ids(
sentence, merge_sentences=merge_sentences)
phone_ids = input_ids["phone_ids"]
else:
print("lang should in be 'zh' here!")
print("lang should in {'zh', 'en'}!")
# merge_sentences=True here, so we only use the first item of phone_ids
phone_ids = phone_ids[0].numpy()
if am_name == 'fastspeech2':
am_input_feed.update({'text': phone_ids})
if am_dataset in {"aishell3", "vctk"}:
am_input_feed.update({'spk_id': spk_id})
elif am_name == 'speedyspeech':
tone_ids = tone_ids[0].numpy()
am_input_feed.update({'phones': phone_ids, 'tones': tone_ids})
......@@ -130,19 +148,40 @@ def parse_args():
'--am',
type=str,
default='fastspeech2_csmsc',
choices=['fastspeech2_csmsc', 'speedyspeech_csmsc'],
choices=[
'fastspeech2_csmsc',
'fastspeech2_aishell3',
'fastspeech2_ljspeech',
'fastspeech2_vctk',
'speedyspeech_csmsc',
],
help='Choose acoustic model type of tts task.')
parser.add_argument(
"--phones_dict", type=str, default=None, help="phone vocabulary file.")
parser.add_argument(
"--tones_dict", type=str, default=None, help="tone vocabulary file.")
parser.add_argument(
'--spk_id',
type=int,
default=0,
help='spk id for multi speaker acoustic model')
# voc
parser.add_argument(
'--voc',
type=str,
default='hifigan_csmsc',
choices=['hifigan_csmsc', 'mb_melgan_csmsc', 'pwgan_csmsc'],
choices=[
'pwgan_csmsc',
'pwgan_aishell3',
'pwgan_ljspeech',
'pwgan_vctk',
'hifigan_csmsc',
'hifigan_aishell3',
'hifigan_ljspeech',
'hifigan_vctk',
'mb_melgan_csmsc',
],
help='Choose vocoder type of tts task.')
# other
parser.add_argument(
......
......@@ -32,14 +32,45 @@ VERSION = '0.0.0'
COMMITID = 'none'
base = [
"editdistance", "g2p_en", "g2pM", "h5py", "inflect", "jieba", "jsonlines",
"kaldiio", "librosa==0.8.1", "loguru", "matplotlib", "nara_wpe",
"onnxruntime", "pandas", "paddlenlp", "paddlespeech_feat", "praatio==5.0.0",
"pypinyin", "pypinyin-dict", "python-dateutil", "pyworld", "resampy==0.2.2",
"sacrebleu", "scipy", "sentencepiece~=0.1.96", "soundfile~=0.10",
"textgrid", "timer", "tqdm", "typeguard", "visualdl", "webrtcvad",
"yacs~=0.1.8", "prettytable", "zhon", "colorlog", "pathos == 0.2.8",
"braceexpand", "pyyaml"
"editdistance",
"g2p_en",
"g2pM",
"h5py",
"inflect",
"jieba",
"jsonlines",
"kaldiio",
"librosa==0.8.1",
"loguru",
"matplotlib",
"nara_wpe",
"onnxruntime",
"pandas",
"paddlenlp",
"paddlespeech_feat",
"praatio==5.0.0",
"pypinyin",
"pypinyin-dict",
"python-dateutil",
"pyworld==0.2.12",
"resampy==0.2.2",
"sacrebleu",
"scipy",
"sentencepiece~=0.1.96",
"soundfile~=0.10",
"textgrid",
"timer",
"tqdm",
"typeguard",
"visualdl",
"webrtcvad",
"yacs~=0.1.8",
"prettytable",
"zhon",
"colorlog",
"pathos == 0.2.8",
"braceexpand",
"pyyaml"
]
server = [
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册