Created by: FrostML
【WIP】In dygraph similarity_net, list problems before this fix:
- use
fluid.layers.reshape(size=[-1, 1])
to guarantee that the shape of Embedding's input is [xx, xx, 1], which is NOT recommended and NOT proper for batch input. - the behavior for lstm and gru to compute sequence representation is NOT align with static graph, which is
fluid.layers.reduce_sum
whilesequence_last_step
for static graph.
Due to pre-commit, code format has been changed. Mark changes as follow to help review.