Remove the CUDA stream synchronization between each operator. (!6284) · 合并请求 · PaddlePaddle / Paddle

Remove the CUDA stream synchronization between each operator. !6284

Created by: qingqing01

Fix https://github.com/PaddlePaddle/Paddle/issues/6283

At first, we add this CUDA stream synchronization in the operator developing period to detect the CUDA error of each CUDA kernel. When the framework is stable, this synchronization should be removed to speed up training.

PaddlePaddle / Paddle 1 年多 前同步成功

Remove the CUDA stream synchronization between each operator. !6284

PaddlePaddle / Paddle
1 年多前同步成功