未验证 提交 83f6e924 编写于 作者: W Wenyu 提交者: GitHub

update vitdet model links (#6602)

* update model pretrained links
上级 494f381f
# Vision transformer Detection # Vision Transformer Detection
## Introduction ## Introduction
...@@ -15,8 +15,8 @@ non-trivial when new architectures, such as Vision Transformer (ViT) models, arr ...@@ -15,8 +15,8 @@ non-trivial when new architectures, such as Vision Transformer (ViT) models, arr
| Backbone | Pretrained | Model | Scheduler | Images/GPU | Box AP | Config | Download | | Backbone | Pretrained | Model | Scheduler | Images/GPU | Box AP | Config | Download |
|:------:|:--------:|:--------------:|:--------------:|:--------------:|:------:|:------:|:--------:| |:------:|:--------:|:--------------:|:--------------:|:--------------:|:------:|:------:|:--------:|
| ViT-base | CAE | Cascade RCNN | 1x | 1 | -- | [config](./cascade_rcnn_vit_base_hrfpn_cae_1x_coco.yml) | [coming soon]() | | ViT-base | CAE | Cascade RCNN | 1x | 1 | 52.7 | [config](./cascade_rcnn_vit_base_hrfpn_cae_1x_coco.yml) | [model](https://bj.bcebos.com/v1/paddledet/models/cascade_rcnn_vit_base_hrfpn_cae_1x_coco.pdparams) |
| ViT-large | CAE | Cascade RCNN | 1x | 1 | -- | [config](./cascade_rcnn_vit_large_hrfpn_cae_1x_coco.yml) | [coming soon]() | | ViT-large | CAE | Cascade RCNN | 1x | 1 | 55.7 | [config](./cascade_rcnn_vit_large_hrfpn_cae_1x_coco.yml) | [model](https://bj.bcebos.com/v1/paddledet/models/cascade_rcnn_vit_large_hrfpn_cae_1x_coco.pdparams) |
**Notes:** **Notes:**
- Model is trained on COCO train2017 dataset and evaluated on val2017 results of `mAP(IoU=0.5:0.95)`. - Model is trained on COCO train2017 dataset and evaluated on val2017 results of `mAP(IoU=0.5:0.95)`.
......
...@@ -52,7 +52,7 @@ VisionTransformer: ...@@ -52,7 +52,7 @@ VisionTransformer:
epsilon: 0.000001 # 1e-6 epsilon: 0.000001 # 1e-6
out_indices: [3, 5, 7, 11] out_indices: [3, 5, 7, 11]
with_fpn: True with_fpn: True
pretrained: ~ pretrained: https://bj.bcebos.com/v1/paddledet/models/pretrained/vit_base_cae_pretrained.pdparams
HRFPN: HRFPN:
out_channel: 256 out_channel: 256
......
...@@ -15,7 +15,7 @@ VisionTransformer: ...@@ -15,7 +15,7 @@ VisionTransformer:
num_heads: 16 num_heads: 16
drop_path_rate: 0.25 drop_path_rate: 0.25
out_indices: [7, 11, 15, 23] out_indices: [7, 11, 15, 23]
pretrained: ~ pretrained: https://bj.bcebos.com/v1/paddledet/models/pretrained/vit_large_cae_pretrained.pdparams
HRFPN: HRFPN:
in_channels: [*dim, *dim, *dim, *dim] in_channels: [*dim, *dim, *dim, *dim]
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册