未验证 提交 cfed8d01 编写于 作者: J Jackwaterveg 提交者: GitHub

Merge pull request #1061 from LittleChenCc/develop

[Bug Fix] fix bugs in the data reader
...@@ -42,7 +42,7 @@ if [ ${stage} -le -1 ] && [ ${stop_stage} -ge -1 ]; then ...@@ -42,7 +42,7 @@ if [ ${stage} -le -1 ] && [ ${stop_stage} -ge -1 ]; then
# generate manifests # generate manifests
python3 ${TARGET_DIR}/ted_en_zh/ted_en_zh.py \ python3 ${TARGET_DIR}/ted_en_zh/ted_en_zh.py \
--manifest_prefix="data/manifest" \ --manifest_prefix="data/manifest" \
--src_dir="${data_dir}" --src-dir="${data_dir}"
echo "Complete raw data pre-process." echo "Complete raw data pre-process."
fi fi
......
...@@ -102,9 +102,9 @@ def read_manifest( ...@@ -102,9 +102,9 @@ def read_manifest(
with jsonlines.open(manifest_path, 'r') as reader: with jsonlines.open(manifest_path, 'r') as reader:
for json_data in reader: for json_data in reader:
feat_len = json_data["input"][0]["shape"][ feat_len = json_data["input"][0]["shape"][
0] if 'shape' in json_data["input"][0] else 1.0 0] if "input" in json_data and "shape" in json_data["input"][0] else 1.0
token_len = json_data["output"][0]["shape"][ token_len = json_data["output"][0]["shape"][
0] if 'shape' in json_data["output"][0] else 1.0 0] if "output" in json_data and "shape" in json_data["output"][0] else 1.0
conditions = [ conditions = [
feat_len >= min_input_len, feat_len >= min_input_len,
feat_len <= max_input_len, feat_len <= max_input_len,
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册