Fork自 PaddlePaddle / Paddle
cherry-pick #30553 fix bug of multicard grad ncclAllReduce, the gradient accumulater of parameters should be keep order, otherwsie, it will influence multicard ncclAllReduce of grad.
拖放文件到此处或点击上传