Fork自 PaddlePaddle / Paddle
* refine structure for cuda and rocm * update * update * update * update
* update index sample