Fork自 PaddlePaddle / Paddle
* make CUDA_ARCH_NAME default Auto test=develop * refine warning test=develop