Fork自 PaddlePaddle / PaddleDetection
* fix __shfl_down_sync_ of cross_entropy * use reduceSum * "fix ci"