README.md 10.0 KB
Newer Older
N
Nikita Manovich 已提交
1 2
# Computer Vision Annotation Tool (CVAT)

3 4 5 6 7 8
[![CI][ci-img]][ci-url]
[![Gitter chat][gitter-img]][gitter-url]
[![Coverage Status][coverage-img]][coverage-url]
[![server pulls][docker-server-pulls-img]][docker-server-image-url]
[![ui pulls][docker-ui-pulls-img]][docker-ui-image-url]
[![DOI][doi-img]][doi-url]
9

10 11 12
CVAT is free, online, interactive video and image annotation
tool for computer vision. It is being used by our team to
annotate million of objects with different properties. Many UI
13 14
and UX decisions are based on feedbacks from professional data
annotation team. Try it online [cvat.org](https://cvat.org).
N
Nikita Manovich 已提交
15

N
Nikita Manovich 已提交
16
![CVAT screenshot](cvat/apps/documentation/static/documentation/images/cvat.jpg)
N
Nikita Manovich 已提交
17 18 19

## Documentation

N
Nikita Manovich 已提交
20 21
- [Installation guide](cvat/apps/documentation/installation.md)
- [User's guide](cvat/apps/documentation/user_guide.md)
22
- [Django REST API documentation](#rest-api)
B
Boris Sekachev 已提交
23
- [Datumaro dataset framework](https://github.com/openvinotoolkit/datumaro/blob/develop/README.md)
24
- [Command line interface](utils/cli/)
N
Nikita Manovich 已提交
25 26
- [XML annotation format](cvat/apps/documentation/xml_format.md)
- [AWS Deployment Guide](cvat/apps/documentation/AWS-Deployment-Guide.md)
27
- [Frequently asked questions](cvat/apps/documentation/faq.md)
N
Nikita Manovich 已提交
28
- [Questions](#questions)
N
Nikita Manovich 已提交
29 30 31

## Screencasts

T
TOsmanov 已提交
32 33 34 35
- [Introduction](https://youtu.be/JERohTFp-NI)
- [Annotation mode](https://youtu.be/vH_639N67HI)
- [Interpolation of bounding boxes](https://youtu.be/Hc3oudNuDsY)
- [Interpolation of polygons](https://youtu.be/K4nis9lk92s)
B
Boris Sekachev 已提交
36
- [Tag annotation video](https://youtu.be/62bI4mF-Xfk)
T
TOsmanov 已提交
37
- [Attribute mode](https://youtu.be/iIkJsOkDzVA)
T
TOsmanov 已提交
38 39 40
- [Segmentation mode](https://youtu.be/9Fe_GzMLo3E)
- [Tutorial for polygons](https://youtu.be/C7-r9lZbjBw)
- [Semi-automatic segmentation](https://youtu.be/9HszWP_qsRQ)
41

42
## Supported annotation formats
A
Andrey Zhavoronkov 已提交
43

A
Anastasia Yasakova 已提交
44 45 46 47 48 49 50
Format selection is possible after clicking on the Upload annotation and Dump
annotation buttons. [Datumaro](https://github.com/openvinotoolkit/datumaro)
dataset framework allows additional dataset transformations via its command
line tool and Python library.

For more information about supported formats look at the
[documentation](cvat/apps/dataset_manager/formats/README.md#formats).
N
Nikita Manovich 已提交
51

V
Vitaliy Nishukov 已提交
52 53 54 55 56 57 58 59 60
| Annotation format                                                             | Import | Export |
| ----------------------------------------------------------------------------- | ------ | ------ |
| [CVAT for images](cvat/apps/documentation/xml_format.md#annotation)           | X      | X      |
| [CVAT for a video](cvat/apps/documentation/xml_format.md#interpolation)       | X      | X      |
| [Datumaro](https://github.com/openvinotoolkit/datumaro)                       |        | X      |
| [PASCAL VOC](http://host.robots.ox.ac.uk/pascal/VOC/)                         | X      | X      |
| Segmentation masks from [PASCAL VOC](http://host.robots.ox.ac.uk/pascal/VOC/) | X      | X      |
| [YOLO](https://pjreddie.com/darknet/yolo/)                                    | X      | X      |
| [MS COCO Object Detection](http://cocodataset.org/#format-data)               | X      | X      |
61
| [TFrecord](https://www.tensorflow.org/tutorials/load_data/tfrecord)           | X      | X      |
V
Vitaliy Nishukov 已提交
62 63
| [MOT](https://motchallenge.net/)                                              | X      | X      |
| [LabelMe 3.0](http://labelme.csail.mit.edu/Release3.0)                        | X      | X      |
A
Anastasia Yasakova 已提交
64 65
| [ImageNet](http://www.image-net.org)                                          | X      | X      |
| [CamVid](http://mi.eng.cam.ac.uk/research/projects/VideoRec/CamVid/)          | X      | X      |
66
| [WIDER Face](http://shuoyang1213.me/WIDERFACE/)                               | X      | X      |
67
| [VGGFace2](https://github.com/ox-vgg/vgg_face2)                               | X      | X      |
68
| [Market-1501](https://www.aitribune.com/dataset/2018051063)                   | X      | X      |
69
| [ICDAR13/15](https://rrc.cvc.uab.es/?ch=2)                                    | X      | X      |
A
Andrey Zhavoronkov 已提交
70

71 72 73
## Deep learning serverless functions for automatic labeling

<!--lint disable maximum-line-length-->
74

A
Ali Jahani 已提交
75 76
| Name                                                                                                    | Type       | Framework  | CPU | GPU |
| ------------------------------------------------------------------------------------------------------- | ---------- | ---------- | --- | --- |
77 78 79 80 81 82 83 84 85 86
| [Deep Extreme Cut](/serverless/openvino/dextr/nuclio)                                                   | interactor | OpenVINO   | X   |     |
| [Faster RCNN](/serverless/openvino/omz/public/faster_rcnn_inception_v2_coco/nuclio)                     | detector   | OpenVINO   | X   |     |
| [Mask RCNN](/serverless/openvino/omz/public/mask_rcnn_inception_resnet_v2_atrous_coco/nuclio)           | detector   | OpenVINO   | X   |     |
| [YOLO v3](/serverless/openvino/omz/public/yolo-v3-tf/nuclio)                                            | detector   | OpenVINO   | X   |     |
| [Object reidentification](/serverless/openvino/omz/intel/person-reidentification-retail-300/nuclio)     | reid       | OpenVINO   | X   |     |
| [Semantic segmentation for ADAS](/serverless/openvino/omz/intel/semantic-segmentation-adas-0001/nuclio) | detector   | OpenVINO   | X   |     |
| [Text detection v4](/serverless/openvino/omz/intel/text-detection-0004/nuclio)                          | detector   | OpenVINO   | X   |     |
| [SiamMask](/serverless/pytorch/foolwood/siammask/nuclio)                                                | tracker    | PyTorch    | X   |     |
| [f-BRS](/serverless/pytorch/saic-vul/fbrs/nuclio)                                                       | interactor | PyTorch    | X   |     |
| [Inside-Outside Guidance](/serverless/pytorch/shiyinzhang/iog/nuclio)                                   | interactor | PyTorch    | X   |     |
A
Ali Jahani 已提交
87
| [Faster RCNN](/serverless/tensorflow/faster_rcnn_inception_v2_coco/nuclio)                              | detector   | TensorFlow | X   | X   |
88
| [Mask RCNN](/serverless/tensorflow/matterport/mask_rcnn/nuclio)                                         | detector   | TensorFlow | X   | X   |
89 90

<!--lint enable maximum-line-length-->
N
Nikita Manovich 已提交
91

92
## Online demo: [cvat.org](https://cvat.org)
93

94 95 96
This is an online demo with the latest version of the annotation tool.
Try it online without local installation. Only own or assigned tasks
are visible to users.
97

98
Disabled features:
V
Vitaliy Nishukov 已提交
99

100 101 102
- [Analytics: management and monitoring of data annotation team](/components/analytics/README.md)

Limitations:
V
Vitaliy Nishukov 已提交
103

104 105
- No more than 10 tasks per user
- Uploaded data is limited to 500Mb
106

107 108 109 110 111 112 113
## Prebuilt Docker images

Prebuilt docker images for CVAT releases are available on Docker Hub:

- [cvat_server](https://hub.docker.com/r/openvino/cvat_server)
- [cvat_ui](https://hub.docker.com/r/openvino/cvat_ui)

114 115
## REST API

116 117
Automatically generated Swagger documentation for Django REST API is available
on `<cvat_origin>/api/swagger`(default: `localhost:8080/api/swagger`).
118

119 120 121
Swagger documentation is visiable on allowed hostes, Update environement
variable in docker-compose.yml file with cvat hosted machine IP or domain
name. Example - `ALLOWED_HOSTS: 'localhost, 127.0.0.1'`.
122

N
Nikita Manovich 已提交
123 124 125 126
## LICENSE

Code released under the [MIT License](https://opensource.org/licenses/MIT).

127 128 129 130 131 132 133 134 135 136
This software uses LGPL licensed libraries from the [FFmpeg](https://www.ffmpeg.org) project.
The exact steps on how FFmpeg was configured and compiled can be found in the [Dockerfile](Dockerfile).

FFmpeg is an open source framework licensed under LGPL and GPL.
See [https://www.ffmpeg.org/legal.html](https://www.ffmpeg.org/legal.html). You are solely responsible
for determining if your use of FFmpeg requires any
additional licenses. Intel is not responsible for obtaining any
such licenses, nor liable for any licensing fees due in
connection with your use of FFmpeg.

I
idriss 已提交
137 138
## Questions

N
Nikita Manovich 已提交
139 140 141
CVAT usage related questions or unclear concepts can be posted in our
[Gitter chat](https://gitter.im/opencv-cvat) for **quick replies** from
contributors and other users.
I
idriss 已提交
142

N
Nikita Manovich 已提交
143 144
However, if you have a feature request or a bug report that can reproduced,
feel free to open an issue (with steps to reproduce the bug if it's a bug
V
Vitaliy Nishukov 已提交
145
report) on [GitHub\* issues](https://github.com/opencv/cvat/issues).
I
idriss 已提交
146

N
Nikita Manovich 已提交
147
If you are not sure or just want to browse other users common questions,
148 149 150
[Gitter chat](https://gitter.im/opencv-cvat) is the way to go.

Other ways to ask questions and get our support:
V
Vitaliy Nishukov 已提交
151 152 153

- [\#cvat](https://stackoverflow.com/search?q=%23cvat) tag on StackOverflow\*
- [Forum on Intel Developer Zone](https://software.intel.com/en-us/forums/computer-vision)
154 155

## Links
V
Vitaliy Nishukov 已提交
156

157 158 159
- [Intel AI blog: New Computer Vision Tool Accelerates Annotation of Digital Images and Video](https://www.intel.ai/introducing-cvat)
- [Intel Software: Computer Vision Annotation Tool: A Universal Approach to Data Annotation](https://software.intel.com/en-us/articles/computer-vision-annotation-tool-a-universal-approach-to-data-annotation)
- [VentureBeat: Intel open-sources CVAT, a toolkit for data labeling](https://venturebeat.com/2019/03/05/intel-open-sources-cvat-a-toolkit-for-data-labeling/)
160 161

## Projects using CVAT
162

163 164 165
- [Onepanel](https://github.com/onepanelio/core) - Onepanel is an open source
  vision AI platform that fully integrates CVAT with scalable data processing
  and parallelized training pipelines.
166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181

<!-- prettier-ignore-start -->
<!-- Badges -->

[docker-server-pulls-img]: https://img.shields.io/docker/pulls/openvino/cvat_server.svg?style=flat-square&label=server%20pulls
[docker-server-image-url]: https://hub.docker.com/r/openvino/cvat_server
[docker-ui-pulls-img]: https://img.shields.io/docker/pulls/openvino/cvat_ui.svg?style=flat-square&label=UI%20pulls
[docker-ui-image-url]: https://hub.docker.com/r/openvino/cvat_ui
[ci-img]: https://github.com/openvinotoolkit/cvat/workflows/CI/badge.svg?branch=develop
[ci-url]: https://github.com/openvinotoolkit/cvat/actions
[gitter-img]: https://badges.gitter.im/opencv-cvat/gitter.png
[gitter-url]: https://gitter.im/opencv-cvat
[coverage-img]: https://coveralls.io/repos/github/openvinotoolkit/cvat/badge.svg?branch=develop
[coverage-url]: https://coveralls.io/github/openvinotoolkit/cvat?branch=develop
[doi-img]: https://zenodo.org/badge/139156354.svg
[doi-url]: https://zenodo.org/badge/latestdoi/139156354