Created by: cjld
Currently, the paddle is compiled by cuda9. It causes RTX series ran slowly(40% slower). We have to support CUDA 10 for Mining the performance of RTX series.