Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
19824a8d
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 2 年 前同步成功
通知
210
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
19824a8d
编写于
8月 14, 2017
作者:
W
wanghaoshuang
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Move local data from global into class DataGenerator.
上级
b4176812
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
13 addition
and
14 deletion
+13
-14
data_utils/data.py
data_utils/data.py
+13
-14
未找到文件。
data_utils/data.py
浏览文件 @
19824a8d
...
@@ -17,11 +17,6 @@ from data_utils.featurizer.speech_featurizer import SpeechFeaturizer
...
@@ -17,11 +17,6 @@ from data_utils.featurizer.speech_featurizer import SpeechFeaturizer
from
data_utils.speech
import
SpeechSegment
from
data_utils.speech
import
SpeechSegment
from
data_utils.normalizer
import
FeatureNormalizer
from
data_utils.normalizer
import
FeatureNormalizer
# for caching tar files info
local_data
=
local
()
local_data
.
tar2info
=
{}
local_data
.
tar2object
=
{}
class
DataGenerator
(
object
):
class
DataGenerator
(
object
):
"""
"""
...
@@ -89,6 +84,10 @@ class DataGenerator(object):
...
@@ -89,6 +84,10 @@ class DataGenerator(object):
self
.
_num_threads
=
num_threads
self
.
_num_threads
=
num_threads
self
.
_rng
=
random
.
Random
(
random_seed
)
self
.
_rng
=
random
.
Random
(
random_seed
)
self
.
_epoch
=
0
self
.
_epoch
=
0
# for caching tar files info
self
.
local_data
=
local
()
self
.
local_data
.
tar2info
=
{}
self
.
local_data
.
tar2object
=
{}
def
process_utterance
(
self
,
filename
,
transcript
):
def
process_utterance
(
self
,
filename
,
transcript
):
"""Load, augment, featurize and normalize for speech data.
"""Load, augment, featurize and normalize for speech data.
...
@@ -241,16 +240,16 @@ class DataGenerator(object):
...
@@ -241,16 +240,16 @@ class DataGenerator(object):
"""
"""
if
file
.
startswith
(
'tar:'
):
if
file
.
startswith
(
'tar:'
):
tarpath
,
filename
=
file
.
split
(
':'
,
1
)[
1
].
split
(
'#'
,
1
)
tarpath
,
filename
=
file
.
split
(
':'
,
1
)[
1
].
split
(
'#'
,
1
)
if
'tar2info'
not
in
local_data
.
__dict__
:
if
'tar2info'
not
in
self
.
local_data
.
__dict__
:
local_data
.
tar2info
=
{}
self
.
local_data
.
tar2info
=
{}
if
'tar2object'
not
in
local_data
.
__dict__
:
if
'tar2object'
not
in
self
.
local_data
.
__dict__
:
local_data
.
tar2object
=
{}
self
.
local_data
.
tar2object
=
{}
if
tarpath
not
in
local_data
.
tar2info
:
if
tarpath
not
in
self
.
local_data
.
tar2info
:
object
,
infoes
=
self
.
_parse_tar
(
tarpath
)
object
,
infoes
=
self
.
_parse_tar
(
tarpath
)
local_data
.
tar2info
[
tarpath
]
=
infoes
self
.
local_data
.
tar2info
[
tarpath
]
=
infoes
local_data
.
tar2object
[
tarpath
]
=
object
self
.
local_data
.
tar2object
[
tarpath
]
=
object
return
local_data
.
tar2object
[
tarpath
].
extractfile
(
return
self
.
local_data
.
tar2object
[
tarpath
].
extractfile
(
local_data
.
tar2info
[
tarpath
][
filename
])
self
.
local_data
.
tar2info
[
tarpath
][
filename
])
else
:
else
:
return
open
(
file
,
'r'
)
return
open
(
file
,
'r'
)
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录