Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
b02e0dae
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 2 年 前同步成功
通知
210
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
未验证
提交
b02e0dae
编写于
4月 07, 2022
作者:
H
Hui Zhang
提交者:
GitHub
4月 07, 2022
浏览文件
操作
浏览文件
下载
差异文件
Merge pull request #1659 from KPatr1ck/docs
[Audio] Remove mcd.
上级
cb613e33
c47c181e
变更
3
隐藏空白更改
内联
并排
Showing
3 changed file
with
3 addition
and
68 deletion
+3
-68
paddleaudio/paddleaudio/metric/__init__.py
paddleaudio/paddleaudio/metric/__init__.py
+0
-1
paddleaudio/paddleaudio/metric/mcd.py
paddleaudio/paddleaudio/metric/mcd.py
+0
-63
paddleaudio/setup.py
paddleaudio/setup.py
+3
-4
未找到文件。
paddleaudio/paddleaudio/metric/__init__.py
浏览文件 @
b02e0dae
...
...
@@ -14,4 +14,3 @@
from
.dtw
import
dtw_distance
from
.eer
import
compute_eer
from
.eer
import
compute_minDCF
from
.mcd
import
mcd_distance
paddleaudio/paddleaudio/metric/mcd.py
已删除
100644 → 0
浏览文件 @
cb613e33
# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
from
typing
import
Callable
import
mcd.metrics
as
mt
import
numpy
as
np
from
mcd
import
dtw
__all__
=
[
'mcd_distance'
,
]
def
mcd_distance
(
xs
:
np
.
ndarray
,
ys
:
np
.
ndarray
,
cost_fn
:
Callable
=
mt
.
logSpecDbDist
)
->
float
:
"""Mel cepstral distortion (MCD), dtw distance.
Dynamic Time Warping.
Uses dynamic programming to compute:
Examples:
.. code-block:: python
wps[i, j] = cost_fn(xs[i], ys[j]) + min(
wps[i-1, j ], // vertical / insertion / expansion
wps[i , j-1], // horizontal / deletion / compression
wps[i-1, j-1]) // diagonal / match
dtw = sqrt(wps[-1, -1])
Cost Function:
Examples:
.. code-block:: python
logSpecDbConst = 10.0 / math.log(10.0) * math.sqrt(2.0)
def logSpecDbDist(x, y):
diff = x - y
return logSpecDbConst * math.sqrt(np.inner(diff, diff))
Args:
xs (np.ndarray): ref sequence, [T,D]
ys (np.ndarray): hyp sequence, [T,D]
cost_fn (Callable, optional): Cost function. Defaults to mt.logSpecDbDist.
Returns:
float: dtw distance
"""
min_cost
,
path
=
dtw
.
dtw
(
xs
,
ys
,
cost_fn
)
return
min_cost
paddleaudio/setup.py
浏览文件 @
b02e0dae
...
...
@@ -19,7 +19,7 @@ from setuptools.command.install import install
from
setuptools.command.test
import
test
# set the version here
VERSION
=
'0.2.
0
'
VERSION
=
'0.2.
1
'
# Inspired by the example at https://pytest.org/latest/goodpractises.html
...
...
@@ -83,9 +83,8 @@ setuptools.setup(
python_requires
=
'>=3.6'
,
install_requires
=
[
'numpy >= 1.15.0'
,
'scipy >= 1.0.0'
,
'resampy >= 0.2.2'
,
'soundfile >= 0.9.0'
,
'colorlog'
,
'dtaidistance == 2.3.1'
,
'mcd >= 0.4'
,
'pathos'
],
'soundfile >= 0.9.0'
,
'colorlog'
,
'dtaidistance == 2.3.1'
,
'pathos'
],
extras_require
=
{
'test'
:
[
'nose'
,
'librosa==0.8.1'
,
'soundfile==0.10.3.post1'
,
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录