fix for clip
1. fix bias_attr to False for conv of PatchEmbed; 2. support return_tokens_mean for Head of CLIP; 3. support remove_cls_token_in_forward for CLIP; 4. support head_init_scale argument for ViT backbone; 5. support get_num_layers() and no_weight_decay() for ViT backbone.
Showing
想要评论请 注册 或 登录