Fork自 PaddlePaddle / Paddle
1. Remove 'top 1'(or argmax) from CPU and GPU kernel 2. Add a new test case 3. Refine doc