Created by: guoshengCS
Decouple the program desc with batch_size in Transformer. The inference program has been validated to have the same generated sentences for different batch size. It relies on https://github.com/PaddlePaddle/Paddle/pull/9008 .
Created by: guoshengCS
Decouple the program desc with batch_size in Transformer. The inference program has been validated to have the same generated sentences for different batch size. It relies on https://github.com/PaddlePaddle/Paddle/pull/9008 .