Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
40466ef6
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 1 年 前同步成功
通知
207
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
体验新版 GitCode,发现更多精彩内容 >>
提交
40466ef6
编写于
8月 24, 2021
作者:
H
huangyuxin
浏览文件
操作
浏览文件
下载
差异文件
Merge branch 'develop' of
https://github.com/PaddlePaddle/DeepSpeech
into ds2_online
上级
b3d27e4b
7840806b
变更
6
隐藏空白更改
内联
并排
Showing
6 changed file
with
25 addition
and
14 deletion
+25
-14
deepspeech/exps/u2_kaldi/model.py
deepspeech/exps/u2_kaldi/model.py
+4
-4
deepspeech/frontend/augmentor/spec_augment.py
deepspeech/frontend/augmentor/spec_augment.py
+3
-0
deepspeech/frontend/utility.py
deepspeech/frontend/utility.py
+1
-1
examples/aishell/s0/README.md
examples/aishell/s0/README.md
+9
-1
examples/aishell/s0/conf/augmentation.json
examples/aishell/s0/conf/augmentation.json
+4
-4
examples/librispeech/s0/conf/augmentation.json
examples/librispeech/s0/conf/augmentation.json
+4
-4
未找到文件。
deepspeech/exps/u2_kaldi/model.py
浏览文件 @
40466ef6
...
...
@@ -228,7 +228,7 @@ class U2Trainer(Trainer):
maxlen_in
=
float
(
'inf'
),
maxlen_out
=
float
(
'inf'
),
minibatches
=
0
,
mini_batch_size
=
1
,
mini_batch_size
=
self
.
args
.
nprocs
,
batch_count
=
'auto'
,
batch_bins
=
0
,
batch_frames_in
=
0
,
...
...
@@ -247,7 +247,7 @@ class U2Trainer(Trainer):
maxlen_in
=
float
(
'inf'
),
maxlen_out
=
float
(
'inf'
),
minibatches
=
0
,
mini_batch_size
=
1
,
mini_batch_size
=
self
.
args
.
nprocs
,
batch_count
=
'auto'
,
batch_bins
=
0
,
batch_frames_in
=
0
,
...
...
@@ -263,7 +263,7 @@ class U2Trainer(Trainer):
json_file
=
config
.
data
.
test_manifest
,
train_mode
=
False
,
sortagrad
=
False
,
batch_size
=
config
.
collator
.
batch_size
,
batch_size
=
config
.
decoding
.
batch_size
,
maxlen_in
=
float
(
'inf'
),
maxlen_out
=
float
(
'inf'
),
minibatches
=
0
,
...
...
@@ -282,7 +282,7 @@ class U2Trainer(Trainer):
json_file
=
config
.
data
.
test_manifest
,
train_mode
=
False
,
sortagrad
=
False
,
batch_size
=
config
.
collator
.
batch_size
,
batch_size
=
config
.
decoding
.
batch_size
,
maxlen_in
=
float
(
'inf'
),
maxlen_out
=
float
(
'inf'
),
minibatches
=
0
,
...
...
deepspeech/frontend/augmentor/spec_augment.py
浏览文件 @
40466ef6
...
...
@@ -151,6 +151,9 @@ class SpecAugmentor(AugmentorBase):
np.ndarray: time warped spectrogram (time, freq)
"""
window
=
max_time_warp
=
self
.
W
if
window
==
0
:
return
x
if
mode
==
"PIL"
:
t
=
x
.
shape
[
0
]
if
t
-
window
<=
window
:
...
...
deepspeech/frontend/utility.py
浏览文件 @
40466ef6
...
...
@@ -46,7 +46,7 @@ def load_dict(dict_path: Optional[Text], maskctc=False) -> Optional[List[Text]]:
with
open
(
dict_path
,
"r"
)
as
f
:
dictionary
=
f
.
readlines
()
char_list
=
[
entry
.
split
(
" "
)[
0
]
for
entry
in
dictionary
]
char_list
=
[
entry
.
s
trip
().
s
plit
(
" "
)[
0
]
for
entry
in
dictionary
]
if
BLANK
not
in
char_list
:
char_list
.
insert
(
0
,
BLANK
)
if
EOS
not
in
char_list
:
...
...
examples/aishell/s0/README.md
浏览文件 @
40466ef6
# Aishell-1
## Data
| Data Subset | Duration in Seconds |
| data/manifest.train | 1.23 ~ 14.53125 |
| data/manifest.dev | 1.645 ~ 12.533 |
| data/manifest.test | 1.859125 ~ 14.6999375 |
`jq '.feat_shape[0]' data/manifest.train | sort -un`
## Deepspeech2
| Model | Params | Release | Config | Test set | Loss | CER |
| --- | --- | --- | --- | --- | --- | --- |
| DeepSpeech2 | 58.4M | 2.2.0 | conf/deepspeech2.yaml + spec aug + new datapipe | test | 6.396368026733398 | 0.068382
,0.073507
|
| DeepSpeech2 | 58.4M | 2.2.0 | conf/deepspeech2.yaml + spec aug + new datapipe | test | 6.396368026733398 | 0.068382 |
| DeepSpeech2 | 58.4M | 2.1.0 | conf/deepspeech2.yaml + spec aug | test | 7.483316898345947 | 0.077860 |
| DeepSpeech2 | 58.4M | 2.1.0 | conf/deepspeech2.yaml | test | 7.299022197723389 | 0.078671 |
| DeepSpeech2 | 58.4M | 2.0.0 | conf/deepspeech2.yaml | test | - | 0.078977 |
...
...
examples/aishell/s0/conf/augmentation.json
浏览文件 @
40466ef6
...
...
@@ -19,17 +19,17 @@
{
"type"
:
"specaug"
,
"params"
:
{
"W"
:
5
,
"W"
:
0
,
"warp_mode"
:
"PIL"
,
"F"
:
3
0
,
"F"
:
1
0
,
"n_freq_masks"
:
2
,
"T"
:
4
0
,
"T"
:
5
0
,
"n_time_masks"
:
2
,
"p"
:
1.0
,
"adaptive_number_ratio"
:
0
,
"adaptive_size_ratio"
:
0
,
"max_n_time_masks"
:
20
,
"replace_with_zero"
:
false
"replace_with_zero"
:
true
},
"prob"
:
1.0
}
...
...
examples/librispeech/s0/conf/augmentation.json
浏览文件 @
40466ef6
...
...
@@ -19,17 +19,17 @@
{
"type"
:
"specaug"
,
"params"
:
{
"W"
:
0
,
"warp_mode"
:
"PIL"
,
"F"
:
10
,
"T"
:
50
,
"n_freq_masks"
:
2
,
"T"
:
50
,
"n_time_masks"
:
2
,
"p"
:
1.0
,
"W"
:
80
,
"adaptive_number_ratio"
:
0
,
"adaptive_size_ratio"
:
0
,
"max_n_time_masks"
:
20
,
"replace_with_zero"
:
true
,
"warp_mode"
:
"PIL"
"replace_with_zero"
:
true
},
"prob"
:
1.0
}
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录