Merge pull request #819 from guoshengCS/refine-transformer-logit
Avoid predicting <pad> by restricting the size of fc_layer in Transformer
Showing
想要评论请 注册 或 登录
Avoid predicting <pad> by restricting the size of fc_layer in Transformer