Created by: chengduoZH
In multi-process mode, the method of getting Place should be :
place = fluid.CUDAPlace(fluid.dygraph.parallel.Env().dev_id) \ if args.use_data_parallel else fluid.CUDAPlace(0)