Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
107f8b89
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 2 年 前同步成功
通知
210
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
107f8b89
编写于
6月 18, 2017
作者:
chrisxu2014
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
add audio augmentation
上级
b8341da6
变更
2
显示空白变更内容
内联
并排
Showing
2 changed file
with
7 addition
and
7 deletion
+7
-7
data_utils/audio.py
data_utils/audio.py
+3
-3
data_utils/speech.py
data_utils/speech.py
+4
-4
未找到文件。
data_utils/audio.py
浏览文件 @
107f8b89
...
@@ -88,7 +88,7 @@ class AudioSegment(object):
...
@@ -88,7 +88,7 @@ class AudioSegment(object):
:rtype: AudioSegment
:rtype: AudioSegment
:raises ValueError: If the number of segments is zero, or if the
:raises ValueError: If the number of segments is zero, or if the
sample_rate of any two segments does not match.
sample_rate of any two segments does not match.
:raises TypeError: If every item in segments is not Audio
s
egment
:raises TypeError: If every item in segments is not Audio
S
egment
instance.
instance.
"""
"""
# Perform basic sanity-checks.
# Perform basic sanity-checks.
...
@@ -296,7 +296,7 @@ class AudioSegment(object):
...
@@ -296,7 +296,7 @@ class AudioSegment(object):
:type prior_db: float
:type prior_db: float
:param prior_samples: Prior strength in number of samples.
:param prior_samples: Prior strength in number of samples.
:type prior_samples: float
:type prior_samples: float
:param startup_delay: Default 0.0
s. If provided, this function will
:param startup_delay: Default 0.0s. If provided, this function will
accrue statistics for the first startup_delay
accrue statistics for the first startup_delay
seconds before applying online normalization.
seconds before applying online normalization.
:type startup_delay: float
:type startup_delay: float
...
@@ -401,7 +401,7 @@ class AudioSegment(object):
...
@@ -401,7 +401,7 @@ class AudioSegment(object):
self
.
subsegment
(
start_time
,
start_time
+
subsegment_length
)
self
.
subsegment
(
start_time
,
start_time
+
subsegment_length
)
def
convolve
(
self
,
impulse_segment
,
allow_resample
=
False
):
def
convolve
(
self
,
impulse_segment
,
allow_resample
=
False
):
"""Convolve this audio segment with the given
filter
.
"""Convolve this audio segment with the given
impulse_segment
.
Note that this is an in-place transformation.
Note that this is an in-place transformation.
...
...
data_utils/speech.py
浏览文件 @
107f8b89
...
@@ -75,11 +75,11 @@ class SpeechSegment(AudioSegment):
...
@@ -75,11 +75,11 @@ class SpeechSegment(AudioSegment):
:rtype: SpeechSegment
:rtype: SpeechSegment
:raises ValueError: If the number of segments is zero, or if the
:raises ValueError: If the number of segments is zero, or if the
sample_rate of any two segments does not match.
sample_rate of any two segments does not match.
:raises TypeError: If every item in segments is not
Audios
egment
:raises TypeError: If every item in segments is not
SpeechS
egment
instance.
instance.
"""
"""
if
len
(
segments
)
==
0
:
if
len
(
segments
)
==
0
:
raise
ValueError
(
"No
audio
segments are given to concatenate."
)
raise
ValueError
(
"No
speech
segments are given to concatenate."
)
sample_rate
=
segments
[
0
].
_sample_rate
sample_rate
=
segments
[
0
].
_sample_rate
transcripts
=
""
transcripts
=
""
for
seg
in
segments
:
for
seg
in
segments
:
...
@@ -116,7 +116,7 @@ class SpeechSegment(AudioSegment):
...
@@ -116,7 +116,7 @@ class SpeechSegment(AudioSegment):
:rtype: SpeechSegment
:rtype: SpeechSegment
"""
"""
audio
=
Audiosegment
.
slice_from_file
(
filepath
,
start
,
end
)
audio
=
Audiosegment
.
slice_from_file
(
filepath
,
start
,
end
)
return
cls
(
audio
.
samples
,
audio
.
sample_rate
,
transcript
s
)
return
cls
(
audio
.
samples
,
audio
.
sample_rate
,
transcript
)
@
classmethod
@
classmethod
def
make_silence
(
cls
,
duration
,
sample_rate
):
def
make_silence
(
cls
,
duration
,
sample_rate
):
...
@@ -128,7 +128,7 @@ class SpeechSegment(AudioSegment):
...
@@ -128,7 +128,7 @@ class SpeechSegment(AudioSegment):
:param sample_rate: Sample rate.
:param sample_rate: Sample rate.
:type sample_rate: float
:type sample_rate: float
:return: Silence of the given duration.
:return: Silence of the given duration.
:rtype:
Audio
Segment
:rtype:
Speech
Segment
"""
"""
audio
=
AudioSegment
.
make_silence
(
duration
,
sample_rate
)
audio
=
AudioSegment
.
make_silence
(
duration
,
sample_rate
)
return
cls
(
audio
.
samples
,
audio
.
sample_rate
,
""
)
return
cls
(
audio
.
samples
,
audio
.
sample_rate
,
""
)
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录