Input size in GRU operator (#12698) · Issue · PaddlePaddle / Paddle

Input size in GRU operator

Created by: tpatejko

I'm working on integrating MKLDNN GRU primitive in PaddlePaddle. I would like to clarify my issue understanding format and size of Input tensor.

is total time step a length of the longest sequence in the batch, or is it the sum of lengths of all the sequences in the batch?