Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
ccdfd5b3
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
1 年多 前同步成功
通知
207
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
ccdfd5b3
编写于
8月 04, 2021
作者:
H
Hui Zhang
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
format
上级
571b13c5
变更
5
隐藏空白更改
内联
并排
Showing
5 changed file
with
25 addition
and
16 deletion
+25
-16
deepspeech/decoders/swig/setup.py
deepspeech/decoders/swig/setup.py
+2
-3
deepspeech/models/ds2/__init__.py
deepspeech/models/ds2/__init__.py
+14
-4
deepspeech/models/ds2/deepspeech2.py
deepspeech/models/ds2/deepspeech2.py
+5
-5
examples/callcenter/s1/README.md
examples/callcenter/s1/README.md
+1
-1
examples/librispeech/s1/README.md
examples/librispeech/s1/README.md
+3
-3
未找到文件。
deepspeech/decoders/swig/setup.py
浏览文件 @
ccdfd5b3
...
...
@@ -84,9 +84,8 @@ FILES = glob.glob('kenlm/util/*.cc') \
FILES
+=
glob
.
glob
(
'openfst-1.6.3/src/lib/*.cc'
)
FILES
=
[
fn
for
fn
in
FILES
if
not
(
fn
.
endswith
(
'main.cc'
)
or
fn
.
endswith
(
'test.cc'
)
or
fn
.
endswith
(
'unittest.cc'
))
fn
for
fn
in
FILES
if
not
(
fn
.
endswith
(
'main.cc'
)
or
fn
.
endswith
(
'test.cc'
)
or
fn
.
endswith
(
'unittest.cc'
))
]
LIBS
=
[
'stdc++'
]
...
...
deepspeech/models/ds2/__init__.py
浏览文件 @
ccdfd5b3
from
.deepspeech2
import
DeepSpeech2Model
# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
from
.deepspeech2
import
DeepSpeech2InferModel
from
.deepspeech2
import
DeepSpeech2Model
__all__
=
[
'DeepSpeech2Model'
,
'DeepSpeech2InferModel'
]
deepspeech/models/ds2/deepspeech2.py
浏览文件 @
ccdfd5b3
...
...
@@ -19,15 +19,15 @@ from paddle import nn
from
yacs.config
import
CfgNode
from
deepspeech.models.ds2.conv
import
ConvStack
from
deepspeech.modules.ctc
import
CTCDecoder
from
deepspeech.models.ds2.rnn
import
RNNStack
from
deepspeech.modules.ctc
import
CTCDecoder
from
deepspeech.utils
import
layer_tools
from
deepspeech.utils.checkpoint
import
Checkpoint
from
deepspeech.utils.log
import
Log
logger
=
Log
(
__name__
).
getlog
()
__all__
=
[
'DeepSpeech2Model'
,
'DeepSpeech2InferMode'
]
__all__
=
[
'DeepSpeech2Model'
,
'DeepSpeech2InferMode
l
'
]
class
CRNNEncoder
(
nn
.
Layer
):
...
...
@@ -117,7 +117,7 @@ class DeepSpeech2Model(nn.Layer):
:type share_weights: bool
:return: A tuple of an output unnormalized log probability layer (
before softmax) and a ctc cost layer.
:rtype: tuple of LayerOutput
:rtype: tuple of LayerOutput
"""
@
classmethod
...
...
@@ -206,10 +206,10 @@ class DeepSpeech2Model(nn.Layer):
config: yacs.config.CfgNode
model configs
checkpoint_path: Path or str
the path of pretrained model checkpoint, without extension name
Returns
-------
DeepSpeech2Model
...
...
examples/callcenter/s1/README.md
浏览文件 @
ccdfd5b3
...
...
@@ -17,4 +17,4 @@
| conformer | 45.73 M | conf/chunk_conformer.yaml | spec_aug + shift | test | attention | 16, -1 | 2.23287845 | 0.087982 |
| conformer | 45.73 M | conf/chunk_conformer.yaml | spec_aug + shift | test | ctc_greedy_search | 16, -1 | 2.23287845 | 0.086962 |
| conformer | 45.73 M | conf/chunk_conformer.yaml | spec_aug + shift | test | ctc_prefix_beam_search | 16, -1 | 2.23287845 | 0.086741 |
| conformer | 45.73 M | conf/chunk_conformer.yaml | spec_aug + shift | test | attention_rescoring | 16, -1 | 2.23287845 | 0.083495 |
| conformer | 45.73 M | conf/chunk_conformer.yaml | spec_aug + shift | test | attention_rescoring | 16, -1 | 2.23287845 | 0.083495 |
examples/librispeech/s1/README.md
浏览文件 @
ccdfd5b3
...
...
@@ -2,10 +2,10 @@
## Data
| Data Subset | Duration in Seconds |
| data/manifest.train | 0.83s ~ 29.735s |
| Data Subset | Duration in Seconds |
| data/manifest.train | 0.83s ~ 29.735s |
| data/manifest.dev | 1.065 ~ 35.155s |
| data/manifest.test-clean | 1.285s ~ 34.955s |
| data/manifest.test-clean | 1.285s ~ 34.955s |
## Conformer
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录