Created by: lcy-seso
- Add a sub_nested_seq layer.
- it accepts two inputs: the first one is a nested sequence in PaddlePaddle; the second one is a learnable score or distribution over each sequence in the nested sequence.
- Then sub_nest_seq_layer selects top k sentences with highest scores or probabilities according to the second input.