Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
DeepSpeech
提交
1cc7905d
D
DeepSpeech
项目概览
PaddlePaddle
/
DeepSpeech
大约 1 年 前同步成功
通知
206
Star
8425
Fork
1598
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
245
列表
看板
标记
里程碑
合并请求
3
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DeepSpeech
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
245
Issue
245
列表
看板
标记
里程碑
合并请求
3
合并请求
3
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
体验新版 GitCode,发现更多精彩内容 >>
提交
1cc7905d
编写于
1月 25, 2022
作者:
小湉湉
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
rm csmsc.py, test=tts
上级
4c3e57a2
变更
2
隐藏空白更改
内联
并排
Showing
2 changed file
with
0 addition
and
57 deletion
+0
-57
paddlespeech/t2s/datasets/__init__.py
paddlespeech/t2s/datasets/__init__.py
+0
-1
paddlespeech/t2s/datasets/csmsc.py
paddlespeech/t2s/datasets/csmsc.py
+0
-56
未找到文件。
paddlespeech/t2s/datasets/__init__.py
浏览文件 @
1cc7905d
...
...
@@ -12,5 +12,4 @@
# See the License for the specific language governing permissions and
# limitations under the License.
from
.common
import
*
from
.csmsc
import
*
from
.ljspeech
import
*
paddlespeech/t2s/datasets/csmsc.py
已删除
100644 → 0
浏览文件 @
4c3e57a2
# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
import
os
from
pathlib
import
Path
from
paddle.io
import
Dataset
__all__
=
[
"CSMSCMetaData"
]
class
CSMSCMetaData
(
Dataset
):
def
__init__
(
self
,
root
):
"""
:param root: the path of baker dataset
"""
self
.
root
=
os
.
path
.
abspath
(
root
)
records
=
[]
index
=
1
self
.
meta_info
=
[
"file_path"
,
"text"
,
"pinyin"
]
metadata_path
=
os
.
path
.
join
(
root
,
"ProsodyLabeling/000001-010000.txt"
)
wav_dirs
=
os
.
path
.
join
(
self
.
root
,
"Wave"
)
with
open
(
metadata_path
,
'r'
,
encoding
=
'utf-8'
)
as
f
:
while
True
:
line1
=
f
.
readline
().
strip
()
if
not
line1
:
break
line2
=
f
.
readline
().
strip
()
strs
=
line1
.
split
()
wav_fname
=
line1
.
split
()[
0
].
strip
()
+
'.wav'
wav_filepath
=
os
.
path
.
join
(
wav_dirs
,
wav_fname
)
text
=
strs
[
1
].
strip
()
pinyin
=
line2
records
.
append
([
wav_filepath
,
text
,
pinyin
])
self
.
records
=
records
def
__getitem__
(
self
,
i
):
return
self
.
records
[
i
]
def
__len__
(
self
):
return
len
(
self
.
records
)
def
get_meta_info
(
self
):
return
self
.
meta_info
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录