Created by: chengduoZH
related issue: https://github.com/PaddlePaddle/Paddle/issues/8818
Just GPU implementation.