未验证 提交 337c8e67 编写于 作者: X Xiaoyao Xi 提交者: GitHub

Merge pull request #22 from xixiaoyao/master

refine logs
...@@ -762,7 +762,10 @@ class MRCReader(BaseReader): ...@@ -762,7 +762,10 @@ class MRCReader(BaseReader):
features = [] features = []
unique_id = 1000000000 unique_id = 1000000000
print('converting examples to features...')
for (example_index, example) in enumerate(examples): for (example_index, example) in enumerate(examples):
if example_index % 1000 == 0:
print('processing {}th example...'.format(example_index))
query_tokens = tokenizer.tokenize(example.question_text) query_tokens = tokenizer.tokenize(example.question_text)
if len(query_tokens) > self.max_query_length: if len(query_tokens) > self.max_query_length:
query_tokens = query_tokens[0:self.max_query_length] query_tokens = query_tokens[0:self.max_query_length]
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册