提交 9c0b6c5b 编写于 作者: H Hui Zhang

fix audio shape bug for audio len

上级 718bd307
......@@ -154,8 +154,8 @@ class SpeechCollator():
random_seed (int, optional): for random generator. Defaults to 0.
keep_transcription_text (bool, optional): True, when not in training mode, will not do tokenizer; Defaults to False.
if ``keep_transcription_text`` is False, text is token ids else is raw string.
Do augmentations
Do augmentations
Padding audio features with zeros to make them have the same shape (or
a user-defined shape) within one batch.
"""
......@@ -271,7 +271,7 @@ class SpeechCollator():
utts.append(utt)
# audio
audios.append(audio) # [T, D]
audio_lens.append(audio.shape[1])
audio_lens.append(audio.shape[0])
# text
# for training, text is token ids
# else text is string, convert to unicode ord
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册