Fork自 PaddlePaddle / Paddle
* add temporal shift and grad *test=kunlun * fix reduce mean grad bug *test=kunlun