Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
3cbfd7bf
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 2 年 前同步成功
通知
210
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
3cbfd7bf
编写于
12月 22, 2021
作者:
J
Jerryuhoo
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Add speaker embedding and speaker id for style fastspeech2 inference
上级
db121226
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
9 addition
and
3 deletion
+9
-3
paddlespeech/t2s/models/fastspeech2/fastspeech2.py
paddlespeech/t2s/models/fastspeech2/fastspeech2.py
+9
-3
未找到文件。
paddlespeech/t2s/models/fastspeech2/fastspeech2.py
浏览文件 @
3cbfd7bf
...
@@ -907,7 +907,9 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
...
@@ -907,7 +907,9 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
energy
:
Union
[
paddle
.
Tensor
,
np
.
ndarray
]
=
None
,
energy
:
Union
[
paddle
.
Tensor
,
np
.
ndarray
]
=
None
,
energy_scale
:
Union
[
int
,
float
]
=
None
,
energy_scale
:
Union
[
int
,
float
]
=
None
,
energy_bias
:
Union
[
int
,
float
]
=
None
,
energy_bias
:
Union
[
int
,
float
]
=
None
,
robot
:
bool
=
False
):
robot
:
bool
=
False
,
spk_emb
=
None
,
spk_id
=
None
):
"""
"""
Parameters
Parameters
----------
----------
...
@@ -938,8 +940,9 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
...
@@ -938,8 +940,9 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
Tensor
Tensor
Output sequence of features (L, odim).
Output sequence of features (L, odim).
"""
"""
spk_id
=
paddle
.
to_tensor
(
spk_id
)
normalized_mel
,
d_outs
,
p_outs
,
e_outs
=
self
.
acoustic_model
.
inference
(
normalized_mel
,
d_outs
,
p_outs
,
e_outs
=
self
.
acoustic_model
.
inference
(
text
,
durations
=
None
,
pitch
=
None
,
energy
=
None
)
text
,
durations
=
None
,
pitch
=
None
,
energy
=
None
,
spk_emb
=
spk_emb
,
spk_id
=
spk_id
)
# priority: groundtruth > scale/bias > previous output
# priority: groundtruth > scale/bias > previous output
# set durations
# set durations
if
isinstance
(
durations
,
np
.
ndarray
):
if
isinstance
(
durations
,
np
.
ndarray
):
...
@@ -991,7 +994,10 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
...
@@ -991,7 +994,10 @@ class StyleFastSpeech2Inference(FastSpeech2Inference):
durations
=
durations
,
durations
=
durations
,
pitch
=
pitch
,
pitch
=
pitch
,
energy
=
energy
,
energy
=
energy
,
use_teacher_forcing
=
True
)
use_teacher_forcing
=
True
,
spk_emb
=
spk_emb
,
spk_id
=
spk_id
)
logmel
=
self
.
normalizer
.
inverse
(
normalized_mel
)
logmel
=
self
.
normalizer
.
inverse
(
normalized_mel
)
return
logmel
return
logmel
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录