[cherry-pick] some fix (#2920)
* fix model name * fix: bug when distillation * modify some default hyperparams to adapt to fine-tune downstream tasks 1. unset EMA because of the relatively small size of most downstream dataset; 2. use mean and std of IMN.
Showing
想要评论请 注册 或 登录