Created by: pkuyym
Currently, paddle only save parts of model leaving Ada/Momentum dropped. However, these information is necessary if we want to resume training from a pre-trained model.