"relu" or "sigmoid" should be added to the end of fc layer
Created by: gmcather
I did a test, and found there's is a minor flaw in the code. https://github.com/PaddlePaddle/models/blob/develop/fluid/neural_machine_translation/transformer/model.py#L169, Add "relu" function might be better.