Fork自 PaddlePaddle / Paddle
* add cuda_device_functions.h * move reduceSum to elementwise_op_function.h