Fork自 PaddlePaddle / Paddle
* add var grad hook test=develop
implement dygraph.parallel.DataParallel to hook reduce op.
add NCCLParallelContext for parallel dygraph