TODO list for the ConvS2S.
Created by: lcy-seso
- [ ] prepare the dataset https://arxiv.org/abs/1605.00459. a shared task with Transformer
- [ ] enhance the lookup_table operator to support the special token: padding index. https://github.com/PaddlePaddle/Paddle/issues/7309. a shared task with Transformer
- implement a general purpose normalization operator https://github.com/PaddlePaddle/Paddle/issues/7350. @lcy-seso
-
wrap the weight normalization.
- l2 normalize layer.
- wrap weight normalization.
- serialize weight normalization for inference.
- wrap the positional embedding.
- wrap the dot product attention. https://github.com/PaddlePaddle/Paddle/pull/7602
- wrap GLU unit. https://github.com/PaddlePaddle/Paddle/pull/7525
- wrap deep convolution encoding and decoding block with attention.
- build the entire model.
- enhance the documentation of operators used in ConvS2S.
- add beam search for ConvS2S.
- merge the entire model into the models repo (actually can merge the work part by part).
- clip by norm