Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
e1f86954
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 2 年 前同步成功
通知
210
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
e1f86954
编写于
7月 20, 2022
作者:
L
lym0302
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
add mix tts, test=tts
上级
21dc77f7
变更
5
隐藏空白更改
内联
并排
Showing
5 changed file
with
101 addition
and
0 deletion
+101
-0
examples/zh_en_tts/tts3/README.md
examples/zh_en_tts/tts3/README.md
+26
-0
examples/zh_en_tts/tts3/local/synthesize_e2e.sh
examples/zh_en_tts/tts3/local/synthesize_e2e.sh
+31
-0
examples/zh_en_tts/tts3/path.sh
examples/zh_en_tts/tts3/path.sh
+13
-0
examples/zh_en_tts/tts3/test.sh
examples/zh_en_tts/tts3/test.sh
+23
-0
paddlespeech/t2s/exps/sentences_mix.txt
paddlespeech/t2s/exps/sentences_mix.txt
+8
-0
未找到文件。
examples/zh_en_tts/tts3/README.md
0 → 100644
浏览文件 @
e1f86954
# Test
We train a Chinese-English mixed fastspeech2 model. The training code is still being sorted out, let's show how to use it first.
The sample rate of the synthesized audio is 22050 Hz.
## Download pretrained models
Put pretrained models in a directory named
`models`
.
-
[
fastspeech2_csmscljspeech_add-zhen.zip
](
https://paddlespeech.bj.bcebos.com/t2s/chinse_english_mixed/models/fastspeech2_csmscljspeech_add-zhen.zip
)
-
[
hifigan_ljspeech_ckpt_0.2.0.zip
](
https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_ckpt_0.2.0.zip
)
```
bash
mkdir
models
cd
models
wget https://paddlespeech.bj.bcebos.com/t2s/chinse_english_mixed/models/fastspeech2_csmscljspeech_add-zhen.zip
unzip fastspeech2_csmscljspeech_add-zhen.zip
wget https://paddlespeech.bj.bcebos.com/Parakeet/released_models/hifigan/hifigan_ljspeech_ckpt_0.2.0.zip
unzip hifigan_ljspeech_ckpt_0.2.0.zip
cd
../
```
## test
You can choose
`--spk_id`
{0, 1} in
`local/synthesize_e2e.sh`
.
```
bash
bash test.sh
```
examples/zh_en_tts/tts3/local/synthesize_e2e.sh
0 → 100755
浏览文件 @
e1f86954
#!/bin/bash
model_dir
=
$1
output
=
$2
am_name
=
fastspeech2_csmscljspeech_add-zhen
am_model_dir
=
${
model_dir
}
/
${
am_name
}
/
stage
=
1
stop_stage
=
1
# hifigan
if
[
${
stage
}
-le
1
]
&&
[
${
stop_stage
}
-ge
1
]
;
then
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize_e2e.py
\
--am
=
fastspeech2_mix
\
--am_config
=
${
am_model_dir
}
/default.yaml
\
--am_ckpt
=
${
am_model_dir
}
/snapshot_iter_94000.pdz
\
--am_stat
=
${
am_model_dir
}
/speech_stats.npy
\
--voc
=
hifigan_ljspeech
\
--voc_config
=
${
model_dir
}
/hifigan_ljspeech_ckpt_0.2.0/default.yaml
\
--voc_ckpt
=
${
model_dir
}
/hifigan_ljspeech_ckpt_0.2.0/snapshot_iter_2500000.pdz
\
--voc_stat
=
${
model_dir
}
/hifigan_ljspeech_ckpt_0.2.0/feats_stats.npy
\
--lang
=
mix
\
--text
=
${
BIN_DIR
}
/../sentences_mix.txt
\
--output_dir
=
${
output
}
/test_e2e
\
--phones_dict
=
${
am_model_dir
}
/phone_id_map.txt
\
--speaker_dict
=
${
am_model_dir
}
/speaker_id_map.txt
\
--spk_id
0
fi
examples/zh_en_tts/tts3/path.sh
0 → 100755
浏览文件 @
e1f86954
#!/bin/bash
export
MAIN_ROOT
=
`
realpath
${
PWD
}
/../../../
`
export
PATH
=
${
MAIN_ROOT
}
:
${
MAIN_ROOT
}
/utils:
${
PATH
}
export
LC_ALL
=
C
export
PYTHONDONTWRITEBYTECODE
=
1
# Use UTF-8 in Python to avoid UnicodeDecodeError when LC_ALL=C
export
PYTHONIOENCODING
=
UTF-8
export
PYTHONPATH
=
${
MAIN_ROOT
}
:
${
PYTHONPATH
}
MODEL
=
fastspeech2
export
BIN_DIR
=
${
MAIN_ROOT
}
/paddlespeech/t2s/exps/
${
MODEL
}
examples/zh_en_tts/tts3/test.sh
0 → 100755
浏览文件 @
e1f86954
#!/bin/bash
set
-e
source
path.sh
gpus
=
0,1
stage
=
3
stop_stage
=
100
model_dir
=
models
output_dir
=
output
# with the following command, you can choose the stage range you want to run
# such as `./run.sh --stage 0 --stop-stage 0`
# this can not be mixed use with `$1`, `$2` ...
source
${
MAIN_ROOT
}
/utils/parse_options.sh
||
exit
1
if
[
${
stage
}
-le
3
]
&&
[
${
stop_stage
}
-ge
3
]
;
then
# synthesize_e2e, vocoder is hifigan by default
CUDA_VISIBLE_DEVICES
=
${
gpus
}
./local/synthesize_e2e.sh
${
model_dir
}
${
output_dir
}
||
exit
-1
fi
paddlespeech/t2s/exps/sentences_mix.txt
0 → 100644
浏览文件 @
e1f86954
001 你好,欢迎使用 Paddle Speech 中英文混合 T T S 功能,开始你的合成之旅吧!
002 我们的声学模型使用了 Fast Speech Two, 声码器使用了 Parallel Wave GAN and Hifi GAN.
003 Paddle N L P 发布 ERNIE Tiny 全系列中文预训练小模型,快速提升预训练模型部署效率,通用信息抽取技术 U I E Tiny 系列模型全新升级,支持速度更快效果更好的 U I E 小模型。
004 Paddle Speech 发布 P P A S R 流式语音识别系统、P P T T S 流式语音合成系统、P P V P R 全链路声纹识别系统。
005 Paddle Bo Bo: 使用 Paddle Speech 的语音合成模块生成虚拟人的声音。
006 热烈欢迎您在 Discussions 中提交问题,并在 Issues 中指出发现的 bug。此外,我们非常希望您参与到 Paddle Speech 的开发中!
007 我喜欢 eat apple, 你喜欢 drink milk。
008 我们要去云南 team building, 非常非常 happy.
\ No newline at end of file
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录