文件 · github/fork/zimoqingfeng/release/0.4 · s920243400 / PaddleDetection · GitCode


English | 简体中文
Documentation:https://paddledetection.readthedocs.io

PaddleDetection
PaddleDetection is an end-to-end object detection development kit based on PaddlePaddle, which
aims to help developers in the whole development of training models, optimizing performance and
inference speed, and deploying models. PaddleDetection provides varied object detection architectures
in modular design, and wealthy data augmentation methods, network components, loss functions, etc.
PaddleDetection supported practical projects such as industrial quality inspection, remote sensing
image object detection, and automatic inspection with its practical features such as model compression
and multi-platform deployment.
PP-YOLO, which is faster and has higer performance than YOLOv4,
has been released, it reached mAP(0.5:0.95) as 45.2% on COCO test2019 dataset and 72.9 FPS on single
Test V100. Please refer to PP-YOLO for details.
Now all models in PaddleDetection require PaddlePaddle version 1.8 or higher, or suitable develop version.

  
Introduction
Features:


Rich models:
PaddleDetection provides rich of models, including 100+ pre-trained models
such as object detection, instance segmentation, face detection etc. It covers
the champion models, the practical detection models for cloud and edge device.


Production Ready:
Key operations are implemented in C++ and CUDA, together with PaddlePaddle's
highly efficient inference engine, enables easy deployment in server environments.


Highly Flexible:
Components are designed to be modular. Model architectures, as well as data
preprocess pipelines, can be easily customized with simple configuration
changes.


Performance Optimized:
With the help of the underlying PaddlePaddle framework, faster training and
reduced GPU memory footprint is achieved. Notably, YOLOv3 training is
much faster compared to other frameworks. Another example is Mask-RCNN
(ResNet50), we managed to fit up to 4 images per GPU (Tesla V100 16GB) during
multi-GPU training.


Supported Architectures:


ResNet
ResNet-vd ¹

ResNeXt-vd
SENet
MobileNet
HRNet
Res2Net


Faster R-CNN
✓
✓
x
✓
✗
✗
✗


Faster R-CNN + FPN
✓
✓
✓
✓
✗
✓
✓


Mask R-CNN
✓
✓
x
✓
✗
✗
✗


Mask R-CNN + FPN
✓
✓
✓
✓
✗
✗
✓


Cascade Faster-RCNN
✓
✓
✓
✗
✗
✗
✗


Cascade Mask-RCNN
✓
✗
✗
✓
✗
✗
✗


Libra R-CNN
✗
✓
✗
✗
✗
✗
✗


RetinaNet
✓
✗
✗
✗
✗
✗
✗


YOLOv3
✓
✓
✗
✗
✓
✗
✗


SSD
✗
✗
✗
✗
✓
✗
✗


BlazeFace
✗
✗
✗
✗
✗
✗
✗


Faceboxes
✗
✗
✗
✗
✗
✗
✗


[1] ResNet-vd models offer much improved accuracy with negligible performance cost.
NOTE: ✓ for config file and pretrain model provided in Model Zoo, ✗ for not provided but is supported generally.
More models:

EfficientDet
FCOS
CornerNet-Squeeze
YOLOv4
PP-YOLO

More Backbones:

DarkNet
VGG
GCNet
CBNet

Advanced Features:


 Synchronized Batch Norm


 Group Norm


 Modulated Deformable Convolution


 Deformable PSRoI Pooling


 Non-local and GCNet


NOTE: Synchronized batch normalization can only be used on multiple GPU devices, can not be used on CPU devices or single GPU device.
The following is the relationship between COCO mAP and FPS on Tesla V100 of representative models of each architectures and backbones.

  
NOTE:


CBResNet stands for Cascade-Faster-RCNN-CBResNet200vd-FPN, which has highest mAP on COCO as 53.3% in PaddleDetection models

Cascade-Faster-RCNN stands for Cascade-Faster-RCNN-ResNet50vd-DCN, which has been optimized to 20 FPS inference speed when COCO mAP as 47.8%
The enhanced YOLOv3-ResNet50vd-DCN is 10.6 absolute percentage points higher than paper on COCO mAP, and inference speed is nearly 70% faster than the darknet framework
All these models can be get in Model Zoo


The following is the relationship between COCO mAP and FPS on Tesla V100 of SOTA object detecters and PP-YOLO, which is faster and has better performance than YOLOv4, and reached mAP(0.5:0.95) as 45.2% on COCO test2019 dataset and 72.9 FPS on single Test V100. Please refer to PP-YOLO for details.

  
Tutorials

Get Started

Installation guide
Quick start on small dataset
Train/Evaluation/Inference
How to train a custom dataset
FAQ


Advanced Tutorial

Guide to preprocess pipeline and dataset definition
Models technical
Transfer learning document

Parameter configuration:

Introduction to the configuration workflow
Parameter configuration for RCNN model


IPython Notebook demo

Model compression

Model compression benchmark
Quantization
Model pruning
Model distillation
Neural Architecture Search


Deployment

Export model for inference
Python inference
C++ inference
Inference benchmark


Model Zoo

Pretrained models are available in the PaddleDetection model zoo.
Mobile models
Anchor free models
Face detection models
Pretrained models for pedestrian detection
Pretrained models for vehicle detection

YOLOv3 enhanced model: Compared to MAP of 33.0% in paper, enhanced YOLOv3 reaches the MAP of 43.6%, and inference speed is improved as well

PP-YOLO: PP-YOLO reeached mAP as 45.3% on COCO dataset，and 72.9 FPS on single Tesla V100
Objects365 2019 Challenge champion model
Best single model of Open Images 2019-Object Detction

Practical Server-side detection method: Inference speed on single V100 GPU can reach 20FPS when COCO mAP is 47.8%.

Large-scale practical object detection models: Large-scale practical server-side detection pretrained models with 676 categories are provided for most application scenarios, which can be used not only for direct inference but also finetuning on other datasets.


License
PaddleDetection is released under the Apache 2.0 license.

Updates
v0.4.0 was released at 05/2020, add PP-YOLO, TTFNet, HTC, ACFPN, etc. And add BlaceFace face landmark detection model, add a series of optimized SSDLite models on mobile side, add data augmentations GridMask and RandomErasing, add Matrix NMS and EMA training, and improved ease of use, fix many known bugs, etc.
Please refer to 版本更新文档 for details.

Contributing
Contributions are highly welcomed and we would really appreciate your feedback!!