README.md 5.2 KB
Newer Older
W
wangguanzhong 已提交
1 2
English | [简体中文](README_cn.md)

3 4 5 6 7 8 9
# PaddleDetection

The goal of PaddleDetection is to provide easy access to a wide range of object
detection models in both industry and research settings. We design
PaddleDetection to be not only performant, production-ready but also highly
flexible, catering to research needs.

W
wangguanzhong 已提交
10
**Now all models in PaddleDetection require PaddlePaddle version 1.6 or higher, or suitable develop version.**
11 12 13 14 15 16 17 18

<div align="center">
  <img src="demo/output/000000570688.jpg" />
</div>


## Introduction

Q
qingqing01 已提交
19
Features:
20 21

- Production Ready:
Q
qingqing01 已提交
22 23

  Key operations are implemented in C++ and CUDA, together with PaddlePaddle's
24 25 26
highly efficient inference engine, enables easy deployment in server environments.

- Highly Flexible:
Q
qingqing01 已提交
27 28

  Components are designed to be modular. Model architectures, as well as data
29 30 31 32
preprocess pipelines, can be easily customized with simple configuration
changes.

- Performance Optimized:
Q
qingqing01 已提交
33 34

  With the help of the underlying PaddlePaddle framework, faster training and
Q
qingqing01 已提交
35
reduced GPU memory footprint is achieved. Notably, YOLOv3 training is
36
much faster compared to other frameworks. Another example is Mask-RCNN
Q
qingqing01 已提交
37 38
(ResNet50), we managed to fit up to 4 images per GPU (Tesla V100 16GB) during
multi-GPU training.
39 40 41

Supported Architectures:

Q
qingqing01 已提交
42 43 44 45 46 47 48 49 50 51 52
|                     | ResNet | ResNet-vd <sup>[1](#vd)</sup> | ResNeXt-vd | SENet | MobileNet | DarkNet | VGG  |
| ------------------- | :----: | ----------------------------: | :--------: | :---: | :-------: | :-----: | :--: |
| Faster R-CNN        |   ✓    |                             ✓ |     x      |   ✓   |     ✗     |    ✗    |  ✗   |
| Faster R-CNN + FPN  |   ✓    |                             ✓ |     ✓      |   ✓   |     ✗     |    ✗    |  ✗   |
| Mask R-CNN          |   ✓    |                             ✓ |     x      |   ✓   |     ✗     |    ✗    |  ✗   |
| Mask R-CNN + FPN    |   ✓    |                             ✓ |     ✓      |   ✓   |     ✗     |    ✗    |  ✗   |
| Cascade Faster-RCNN |   ✓    |                             ✓ |     ✓      |   ✗   |     ✗     |    ✗    |  ✗   |
| Cascade Mask-RCNN   |   ✓    |                             ✗ |     ✗      |   ✓   |     ✗     |    ✗    |  ✗   |
| RetinaNet           |   ✓    |                             ✗ |     ✗      |   ✗   |     ✗     |    ✗    |  ✗   |
| YOLOv3              |   ✓    |                             ✗ |     ✗      |   ✗   |     ✓     |    ✓    |  ✗   |
| SSD                 |   ✗    |                             ✗ |     ✗      |   ✗   |     ✓     |    ✗    |  ✓   |
53

Q
qingqing01 已提交
54
<a name="vd">[1]</a> [ResNet-vd](https://arxiv.org/pdf/1812.01187) models offer much improved accuracy with negligible performance cost.
55 56 57

Advanced Features:

Q
qingqing01 已提交
58 59 60 61
- [x] **Synchronized Batch Norm**: currently used by YOLOv3.
- [x] **Group Norm**
- [x] **Modulated Deformable Convolution**
- [x] **Deformable PSRoI Pooling**
62

63 64
**NOTE:** Synchronized batch normalization can only be used on multiple GPU devices, can not be used on CPU devices or single GPU device.

Q
qingqing01 已提交
65
## Get Started
66

Q
qingqing01 已提交
67
- [Installation guide](docs/INSTALL.md)
Q
qingqing01 已提交
68
- [Quick start on small dataset](docs/QUICK_STARTED.md)
Q
qingqing01 已提交
69
- For detailed training and evaluation workflow, please refer to [GETTING_STARTED](docs/GETTING_STARTED.md)
Q
qingqing01 已提交
70 71 72 73 74
- [Guide to preprocess pipeline and custom dataset](docs/DATA.md)
- [Introduction to the configuration workflow](docs/CONFIG.md)
- [Examples for detailed configuration explanation](docs/config_example/)
- [IPython Notebook demo](demo/mask_rcnn_demo.ipynb)
- [Transfer learning document](docs/TRANSFER_LEARNING.md)
75

Q
qingqing01 已提交
76
## Model Zoo
77

Q
qingqing01 已提交
78 79 80
- Pretrained models are available in the [PaddleDetection model zoo](docs/MODEL_ZOO.md).
- [Face detection models](configs/face_detection/README.md)
- [Pretrained models for pedestrian  and vehicle detection](contrib/README.md)
81

Q
qingqing01 已提交
82
## Model compression
83

Q
qingqing01 已提交
84 85
- [Quantization-aware training example](slim/quantization)
- [Model pruning example](slim/prune)
86

Q
qingqing01 已提交
87
## Deployment
88

Q
qingqing01 已提交
89 90
- [Export model for inference](docs/EXPORT_MODEL.md)
- [C++ inference](inference/README.md)
91

Q
qingqing01 已提交
92
## Benchmark
93

Q
qingqing01 已提交
94
- [Inference benchmark](docs/BENCHMARK_INFER_cn.md)
W
wangguanzhong 已提交
95

96

Q
qingqing01 已提交
97
## Updates
98

Q
qingqing01 已提交
99
#### 10/2019
100

W
wangguanzhong 已提交
101
- Add enhanced YOLOv3 models, box mAP up to 41.4%.
Q
qingqing01 已提交
102 103 104 105 106 107 108
- Face detection models included: BlazeFace, Faceboxes.
- Enrich COCO models,  box mAP up to 51.9%.
- Add CACacascade RCNN, one of the best single model of Objects365 2019 challenge Full Track champion.
- Add pretrained models for pedestrian and vehicle detection.
- Support mixed-precision training.
- Add C++ inference depolyment.
- Add model compression examples.
109

Q
qingqing01 已提交
110
#### 2/9/2019
111

Q
qingqing01 已提交
112
- Add retrained models for GroupNorm.
113

Q
qingqing01 已提交
114
- Add Cascade-Mask-RCNN+FPN.
115

Q
qingqing01 已提交
116
#### 5/8/2019
117

Q
qingqing01 已提交
118
- Add a series of models ralated modulated Deformable Convolution.
119

Q
qingqing01 已提交
120
#### 29/7/2019
W
wangguanzhong 已提交
121 122 123 124

- Update Chinese docs for PaddleDetection
- Fix bug in R-CNN models when train and test at the same time
- Add ResNext101-vd + Mask R-CNN + FPN models
Q
qingqing01 已提交
125
- Add YOLOv3 on VOC models
W
wangguanzhong 已提交
126

Q
qingqing01 已提交
127
#### 3/7/2019
Q
qingqing01 已提交
128

129 130
- Initial release of PaddleDetection and detection model zoo
- Models included: Faster R-CNN, Mask R-CNN, Faster R-CNN+FPN, Mask
Q
qingqing01 已提交
131
  R-CNN+FPN, Cascade-Faster-RCNN+FPN, RetinaNet, YOLOv3, and SSD.
132 133 134 135 136


## Contributing

Contributions are highly welcomed and we would really appreciate your feedback!!