Make the distillation process not save teacher variables in PaddleSlim (#19633)
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop
Showing
想要评论请 注册 或 登录
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop