mixed precision training: change optimize_role ops order behind backward_role ops (!21684) · 合并请求 · PaddlePaddle / Paddle

mixed precision training: change optimize_role ops order behind backward_role ops !21684

Created by: danleifeng

in fp16_util.py, update_role_var_grad function will change cast op role, which makes effect in parallelExecutor. But in Executor, it may cause errors without nccl synchronization.

To solve this problem, we can move optimize_role ops behind all the backward_role ops. It can also speed up Executor training.

PaddlePaddle / Paddle 大约 1 年 前同步成功

mixed precision training: change optimize_role ops order behind backward_role ops !21684

PaddlePaddle / Paddle
大约 1 年前同步成功