Created by: guoshengCS
The calculations for final output are different between gru_unit_op and gru_op:
h_t = dot((1 - u_t), h_{t-1}) + dot(u_t, {h}_t)
h_t = dot((1 - u_t), {h}_t) + dot(u_t, h_{t-1})