Created by: wangchaochaohu
Fix the https://github.com/PaddlePaddle/Paddle/pull/21643, Use the MeanCUDAGradKernel about 20% performance improvement(size 1000000)
Created by: wangchaochaohu
Fix the https://github.com/PaddlePaddle/Paddle/pull/21643, Use the MeanCUDAGradKernel about 20% performance improvement(size 1000000)