Merge pull request #1574 from velconia/change_transformer_delete_scope_strategy

Change transformer for adapting to new delete scope strategy

Merge pull request #1574 from velconia/change_transformer_delete_scope_strategy
Change transformer for adapting to new delete scope strategy
e5d416a8 · Qiyang Min · GitHub · 1fb1a82f · 3191ccfc · e5d416a8
隐藏空白更改
内联并排

Showing with 1 addition and 1 deletion

fluid/PaddleNLP/neural_machine_translation/transformer/train.py ...PaddleNLP/neural_machine_translation/transformer/train.py +1 -1

未找到文件。
--- a/fluid/PaddleNLP/neural_machine_translation/transformer/train.py
+++ b/fluid/PaddleNLP/neural_machine_translation/transformer/train.py
@@ -469,7 +469,7 @@ def train_loop(exe,
    # For faster executor
    exec_strategy = fluid.ExecutionStrategy()
    exec_strategy.use_experimental_executor = True
-    # exec_strategy.num_iteration_per_drop_scope = 5
+    exec_strategy.num_iteration_per_drop_scope = int(args.fetch_steps)
    build_strategy = fluid.BuildStrategy()
    # Since the token number differs among devices, customize gradient scale to
    # use token average cost among multi-devices. and the gradient scale is