提交 7da85c6c 编写于 作者: G guosheng

Refine comments of batch_size in Transformer

上级 99437ee9
......@@ -43,9 +43,11 @@ def parse_args():
parser.add_argument(
"--batch_size",
type=int,
default=2000,
default=2048,
help="The number of sequences contained in a mini-batch, or the maximum "
"number of tokens (include paddings) contained in a mini-batch.")
"number of tokens (include paddings) contained in a mini-batch. Note "
"that this represents the number on single device and the actual batch "
"size for multi-devices will multiply the device number.")
parser.add_argument(
"--pool_size",
type=int,
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册