<p>fsp_loss为program内的teacher var和student var添加fsp loss,出自论文<ahref="http://openaccess.thecvf.com/content_cvpr_2017/papers/Yim_A_Gift_From_CVPR_2017_paper.pdf">A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning</a></p>
<p>fsp_loss为program内的teacher var和student var添加fsp loss,出自论文<ahref="http://openaccess.thecvf.com/content_cvpr_2017/papers/Yim_A_Gift_From_CVPR_2017_paper.pdf"><<A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning>></a></p>
<p>soft_label_loss为program内的teacher var和student var添加soft label loss,出自论文<ahref="https://arxiv.org/pdf/1503.02531.pdf">Distilling the Knowledge in a Neural Network</a></p>
<p>soft_label_loss为program内的teacher var和student var添加soft label loss,出自论文<ahref="https://arxiv.org/pdf/1503.02531.pdf"><<Distilling the Knowledge in a Neural Network>></a></p>