Merge pull request #13460 from reyoung/fix_data_transform

Wait input when data transform

Merge pull request #13460 from reyoung/fix_data_transform
Wait input when data transform
aa79bccf · Yu Yang · GitHub · 5dc51750 · 922dee3b · aa79bccf
隐藏空白更改
内联并排

Showing with 4 addition and 0 deletion

paddle/fluid/framework/data_device_transform.cc paddle/fluid/framework/data_device_transform.cc +4 -0

未找到文件。
--- a/paddle/fluid/framework/data_device_transform.cc
+++ b/paddle/fluid/framework/data_device_transform.cc
@@ -25,6 +25,10 @@ void TransDataDevice(const Tensor &in, const platform::Place &dst_place,
      in.place().which(), dst_place.which(),
      "Currently, model parallelism is only supported between CPU and CUDA");

+  // NOTE(yy): TransDataDevice should wait for computation of input.
+  platform::DeviceContextPool::Instance().Get(in.place())->Wait();
+  platform::DeviceContextPool::Instance().Get(dst_place)->Wait();
+
  // FIXME(zcd): TransDataDevice is used to transform data from GPU to CPU and
  // the enforced checkings have been done in GetDeviceContext, so the
  // `dev_ctx->Wait()` is necessary. But `dev_ctx->Wait()` will make the program