Make the distillation process not save teacher variables in PaddleSlim (#19633)
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop
Showing
想要评论请 注册 或 登录
Fork自 PaddlePaddle / Paddle
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop