未验证 提交 ad251cb5 编写于 作者: S sneaxiy 提交者: GitHub

add missing scale parameter (#47519)

上级 fece00d0
......@@ -193,7 +193,7 @@ def fused_allreduce_gradients_with_group(
else _apply_collective_grads
)
with framework.no_grad():
apply_func(parameter_list, group, bucket_size)
apply_func(parameter_list, group, bucket_size, scale)
def fused_allreduce_gradients(parameter_list, hcg):
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册