Format input examples to match the dataset. (#276)

test=develop

Format input examples to match the dataset. (#276)
test=develop
e1a8d5c6 · mhlwsk · Zeyu Chen · 14755b3b · e1a8d5c6
隐藏空白更改
内联并排

Showing with 11 addition and 0 deletion

demo/sequence-labeling/predict.py demo/sequence-labeling/predict.py +11 -0

未找到文件。
--- a/demo/sequence-labeling/predict.py
+++ b/demo/sequence-labeling/predict.py
@@ -100,6 +100,17 @@ if __name__ == '__main__':
        ["不过重在晋趣，略增明人气息，妙在集古有道、不露痕迹罢了。"],
    ]

+    # Add 0x02 between characters to match the format of training data,
+    # otherwise the length of prediction results will not match the input string
+    # if the input string contains non-Chinese characters.
+    tmp_data = []
+    for example in data:
+        formatted = []
+        for sentence in example:
+            formatted.append('\x02'.join(list(sentence)))
+        tmp_data.append(formatted)
+    data = tmp_data
+
    run_states = seq_label_task.predict(data=data)
    results = [run_state.run_results for run_state in run_states]