Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
c36039ce
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 1 年 前同步成功
通知
207
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
体验新版 GitCode,发现更多精彩内容 >>
提交
c36039ce
编写于
3月 22, 2022
作者:
小湉湉
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
update readme for ljspeech hifigan, test=tts
上级
a151935e
变更
2
隐藏空白更改
内联
并排
Showing
2 changed file
with
81 addition
and
30 deletion
+81
-30
examples/ljspeech/tts3/local/synthesize.sh
examples/ljspeech/tts3/local/synthesize.sh
+39
-14
examples/ljspeech/tts3/local/synthesize_e2e.sh
examples/ljspeech/tts3/local/synthesize_e2e.sh
+42
-16
未找到文件。
examples/ljspeech/tts3/local/synthesize.sh
浏览文件 @
c36039ce
...
...
@@ -4,17 +4,42 @@ config_path=$1
train_output_path
=
$2
ckpt_name
=
$3
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
pwgan_ljspeech
\
--voc_config
=
pwg_ljspeech_ckpt_0.5/pwg_default.yaml
\
--voc_ckpt
=
pwg_ljspeech_ckpt_0.5/pwg_snapshot_iter_400000.pdz
\
--voc_stat
=
pwg_ljspeech_ckpt_0.5/pwg_stats.npy
\
--test_metadata
=
dump/test/norm/metadata.jsonl
\
--output_dir
=
${
train_output_path
}
/test
\
--phones_dict
=
dump/phone_id_map.txt
stage
=
0
stop_stage
=
0
# pwgan
if
[
${
stage
}
-le
0
]
&&
[
${
stop_stage
}
-ge
0
]
;
then
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
pwgan_ljspeech
\
--voc_config
=
pwg_ljspeech_ckpt_0.5/pwg_default.yaml
\
--voc_ckpt
=
pwg_ljspeech_ckpt_0.5/pwg_snapshot_iter_400000.pdz
\
--voc_stat
=
pwg_ljspeech_ckpt_0.5/pwg_stats.npy
\
--test_metadata
=
dump/test/norm/metadata.jsonl
\
--output_dir
=
${
train_output_path
}
/test
\
--phones_dict
=
dump/phone_id_map.txt
fi
# hifigan
if
[
${
stage
}
-le
0
]
&&
[
${
stop_stage
}
-ge
0
]
;
then
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
hifigan_ljspeech
\
--voc_config
=
hifigan_ljspeech_ckpt_0.2.0/default.yaml
\
--voc_ckpt
=
hifigan_ljspeech_ckpt_0.2.0/snapshot_iter_2500000.pdz
\
--voc_stat
=
hifigan_ljspeech_ckpt_0.2.0/feats_stats.npy
\
--test_metadata
=
dump/test/norm/metadata.jsonl
\
--output_dir
=
${
train_output_path
}
/test
\
--phones_dict
=
dump/phone_id_map.txt
fi
examples/ljspeech/tts3/local/synthesize_e2e.sh
浏览文件 @
c36039ce
...
...
@@ -4,19 +4,45 @@ config_path=$1
train_output_path
=
$2
ckpt_name
=
$3
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize_e2e.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
pwgan_ljspeech
\
--voc_config
=
pwg_ljspeech_ckpt_0.5/pwg_default.yaml
\
--voc_ckpt
=
pwg_ljspeech_ckpt_0.5/pwg_snapshot_iter_400000.pdz
\
--voc_stat
=
pwg_ljspeech_ckpt_0.5/pwg_stats.npy
\
--lang
=
en
\
--text
=
${
BIN_DIR
}
/../sentences_en.txt
\
--output_dir
=
${
train_output_path
}
/test_e2e
\
--inference_dir
=
${
train_output_path
}
/inference
\
--phones_dict
=
dump/phone_id_map.txt
\ No newline at end of file
stage
=
0
stop_stage
=
0
# pwgan
if
[
${
stage
}
-le
0
]
&&
[
${
stop_stage
}
-ge
0
]
;
then
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize_e2e.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
pwgan_ljspeech
\
--voc_config
=
pwg_ljspeech_ckpt_0.5/pwg_default.yaml
\
--voc_ckpt
=
pwg_ljspeech_ckpt_0.5/pwg_snapshot_iter_400000.pdz
\
--voc_stat
=
pwg_ljspeech_ckpt_0.5/pwg_stats.npy
\
--lang
=
en
\
--text
=
${
BIN_DIR
}
/../sentences_en.txt
\
--output_dir
=
${
train_output_path
}
/test_e2e
\
--inference_dir
=
${
train_output_path
}
/inference
\
--phones_dict
=
dump/phone_id_map.txt
fi
# hifigan
if
[
${
stage
}
-le
1
]
&&
[
${
stop_stage
}
-ge
1
]
;
then
FLAGS_allocator_strategy
=
naive_best_fit
\
FLAGS_fraction_of_gpu_memory_to_use
=
0.01
\
python3
${
BIN_DIR
}
/../synthesize_e2e.py
\
--am
=
fastspeech2_ljspeech
\
--am_config
=
${
config_path
}
\
--am_ckpt
=
${
train_output_path
}
/checkpoints/
${
ckpt_name
}
\
--am_stat
=
dump/train/speech_stats.npy
\
--voc
=
hifigan_ljspeech
\
--voc_config
=
hifigan_ljspeech_ckpt_0.2.0/default.yaml
\
--voc_ckpt
=
hifigan_ljspeech_ckpt_0.2.0/snapshot_iter_2500000.pdz
\
--voc_stat
=
hifigan_ljspeech_ckpt_0.2.0/feats_stats.npy
\
--lang
=
en
\
--text
=
${
BIN_DIR
}
/../sentences_en.txt
\
--output_dir
=
${
train_output_path
}
/test_e2e
\
--inference_dir
=
${
train_output_path
}
/inference
\
--phones_dict
=
dump/phone_id_map.txt
fi
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录