[cherry pick] #1741 to release/0.5 (#1746)

* Update README_cn.md * Correct Format of README_en.md * Correct details in README_en.md Co-authored-by: N Guanghua Yu <742925032@qq.com>

[cherry pick] #1741 to release/0.5 (#1746)
* Update README_cn.md * Correct Format of README_en.md * Correct details in README_en.md Co-authored-by: N Guanghua Yu <742925032@qq.com>
8d4f12bc · mls1999725 · GitHub · 0c59c04f · 8d4f12bc · 8d4f12bc
隐藏空白更改
内联并排

Showing with 225 addition and 138 deletion

README_cn.md README_cn.md +1 -1

README_en.md README_en.md +224 -137

未找到文件。
--- a/README_cn.md
+++ b/README_cn.md
@@ -15,7 +15,7 @@ PaddleDetection模块化地实现了多种主流目标检测算法，提供了
 </div>
 ### 产品动态
+- 2020.10.01: 添加实例分割模型SOLOv2，在Tesla V100上达到38.6 FPS, COCO-val数据集上mask ap达到38.8，预测速度提高24%，mAP提高2.4个百分点。
 - 2020.09.30: 发布[移动端检测demo](deploy/android_demo)，可直接扫码安装体验。
 - 2020.09.21-27: 【目标检测7日打卡课】手把手教你从入门到进阶，深入了解目标检测算法的前世今生。立即加入课程QQ交流群(1136406895)一起学习吧 :)
 - 2020.07.24: 发布**产业最实用**目标检测模型 [PP-YOLO](https://arxiv.org/abs/2007.12099) ，深入考虑产业应用对精度速度的双重面诉求，COCO数据集精度45.2%(最新45.9%)，Tesla V100预测速度72.9 FPS，详细信息见[文档](configs/ppyolo/README_cn.md)。

--- a/README_en.md
+++ b/README_en.md
@@ -2,177 +2,264 @@ English | [简体中文](README_cn.md)
 Documentation:[https://paddledetection.readthedocs.io](https://paddledetection.readthedocs.io)
-# PaddleDetection
+# Introduction
-PaddleDetection is an end-to-end object detection development kit based on PaddlePaddle, which
+PaddleDetection is an end-to-end object detection development kit based on PaddlePaddle, which aims to help developers in the whole development of constructing, training, optimizing and deploying detection models in a faster and better way.
-aims to help developers in the whole development of training models, optimizing performance and
-inference speed, and deploying models. PaddleDetection provides varied object detection architectures
-in modular design, and wealthy data augmentation methods, network components, loss functions, etc.
-PaddleDetection supported practical projects such as industrial quality inspection, remote sensing
-image object detection, and automatic inspection with its practical features such as model compression
-and multi-platform deployment.
-[PP-YOLO](https://arxiv.org/abs/2007.12099), which is faster and has higer performance than YOLOv4,
+PaddleDetection implements varied mainstream object detection algorithms in modular design, and provides wealthy data augmentation methods, network components(such as backbones), loss functions, etc., and integrates abilities of model compression and cross-platform high-performance deployment.
-has been released, it reached mAP(0.5:0.95) as 45.2%(newest 45.9%) on COCO test2019 dataset and
-72.9 FPS on single Test V100. Please refer to [PP-YOLO](configs/ppyolo/README.md) for details.
-**Now all models in PaddleDetection require PaddlePaddle version 1.8 or higher, or suitable develop version.**
+After a long time of industry practice polishing, PaddleDetection has had smooth and excellent user experience, it has been widely used by developers in more than ten industries such as industrial quality inspection, remote sensing image object detection, automatic inspection, new retail, Internet, and scientific research.
 <div align="center">
-  <img src="docs/images/000000570688.jpg" />
+  <img src="docs/images/football.gif" width='800'/>
 </div>
+### Product dynamic
-## Introduction
+- 2020.10.01: Added SOLOv2 as an instance segmentation model, which reached 38.6 FPS on a single Tesla V100, 38.8 mask AP on Coco-Val dataset, and  inference speed increased by 24% and mAP by 2.4 percentage points.
-Features:
+- 2020.09.30: Released the mobile-side detection demo, and you can directly scan the code for installation experience.
+- 2020.09.21-27: [Object detection 7 days of punching class] Hand in hand to teach you from the beginning to the advanced level, in-depth understanding of the object detection algorithm life. Join the course QQ group (1136406895) to study together :)
- Rich models:
+- 2020.07.24: [PP-YOLO](https://arxiv.org/abs/2007.12099), which is **the most practical** object detection model, was released, it deeply considers the double demands of industrial applications for accuracy and speed, and reached accuracy as 45.2% (the latest 45.9%) on COCO dataset, inference speed as 72.9 FPS on a single Test V100. Please refer to [PP-YOLO](https://arxiv.org/abs/2007.12099) for details.
+- 2020.06.11: Publish 676 classes of large-scale server-side practical object detection models that are applicable to most application scenarios and can be used directly for prediction or for fine-tuning other tasks.
-  PaddleDetection provides rich of models, including 100+ pre-trained models
-such as object detection, instance segmentation, face detection etc. It covers
+### Features
-the champion models, the practical detection models for cloud and edge device.
+- **Rich Models**
- Production Ready:
+PaddleDetection provides rich of models, including **100+ pre-trained models** such as **object detection**, **instance segmentation**, **face detection** etc. It covers a variety of **global competition champion** schemes.
-  Key operations are implemented in C++ and CUDA, together with PaddlePaddle's
+- **Use Concisely**
-highly efficient inference engine, enables easy deployment in server environments.
+Modular design, decouple each network component, developers easily build and try various detection models and optimization strategies, quickly get high-performance, customized algorithm.
- Highly Flexible:
+- **Getting Through End to End**
+From data augmentation, constructing models, training, compression, depolyment, get through end to end, and complete support for multi-architecture, multi-device deployment for **cloud and edge device**.
-  Components are designed to be modular. Model architectures, as well as data
-preprocess pipelines, can be easily customized with simple configuration
+- **High Performance:**
-changes.
+Based on the high performance core of PaddlePaddle, advantages of training speed and memory occupation are obvious. Support FP16 training, support multi-machine training.
- Performance Optimized:
+#### Overview of Kit Structures
-  With the help of the underlying PaddlePaddle framework, faster training and
+<table>
-reduced GPU memory footprint is achieved. Notably, YOLOv3 training is
+  <tbody>
-much faster compared to other frameworks. Another example is Mask-RCNN
+    <tr align="center" valign="bottom">
-(ResNet50), we managed to fit up to 4 images per GPU (Tesla V100 16GB) during
+      <td>
-multi-GPU training.
+        <b>Architectures</b>
+      </td>
-Supported Architectures:
+      <td>
+        <b>Backbones</b>
-|                     | ResNet | ResNet-vd <sup>[1](#vd)</sup> | ResNeXt-vd | SENet | MobileNet |  HRNet | Res2Net |
+      </td>
-| ------------------- | :----: | ----------------------------: | :--------: | :---: | :-------: |:------:|:-----:  |
+      <td>
-| Faster R-CNN        |   ✓    |                             ✓ |     x      |   ✓   |     ✗     |   ✗    |  ✗      |
+        <b>Components</b>
-| Faster R-CNN + FPN  |   ✓    |                             ✓ |     ✓      |   ✓   |     ✗     |   ✓    |  ✓      |
+      </td>
-| Mask R-CNN          |   ✓    |                             ✓ |     x      |   ✓   |     ✗     |   ✗    |  ✗      |
+      <td>
-| Mask R-CNN + FPN    |   ✓    |                             ✓ |     ✓      |   ✓   |     ✗     |   ✗    |  ✓      |
+        <b>Data Augmentation</b>
-| Cascade Faster-RCNN |   ✓    |                             ✓ |     ✓      |   ✗   |     ✗     |   ✗    |  ✗      |
+      </td>
-| Cascade Mask-RCNN   |   ✓    |                             ✗ |     ✗      |   ✓   |     ✗     |   ✗    |  ✗      |
+    </tr>
-| Libra R-CNN         |   ✗    |                             ✓ |     ✗      |   ✗   |     ✗     |   ✗    |  ✗      |
+    <tr valign="top">
-| RetinaNet           |   ✓    |                             ✗ |     ✗      |   ✗   |     ✗     |   ✗    |  ✗      |
+      <td>
-| YOLOv3              |   ✓    |                             ✓ |     ✗      |   ✗   |     ✓     |   ✗    |  ✗      |
+        <ul><li><b>Two-Stage Detection</b></li>
-| SSD                 |   ✗    |                             ✗ |     ✗      |   ✗   |     ✓     |   ✗    |  ✗      |
+          <ul>
-| BlazeFace           |   ✗    |                             ✗ |     ✗      |   ✗   |     ✗     |   ✗    |  ✗      |
+            <li>Faster RCNN</li>
-| Faceboxes           |   ✗    |                             ✗ |     ✗      |   ✗   |     ✗     |   ✗    |  ✗      |
+            <li>FPN</li>
+            <li>Cascade-RCNN</li>
-<a name="vd">[1]</a> [ResNet-vd](https://arxiv.org/pdf/1812.01187) models offer much improved accuracy with negligible performance cost.
+            <li>Libra RCNN</li>
+            <li>Hybrid Task RCNN</li>
-**NOTE:** ✓ for config file and pretrain model provided in [Model Zoo](docs/MODEL_ZOO.md), ✗ for not provided but is supported generally.
+            <li>PSS-Det RCNN</li>
+          </ul>
-More models:
+        </ul>
+        <ul><li><b>One-Stage Detection</b></li>
- EfficientDet
+          <ul>
- FCOS
+            <li>RetinaNet</li>
- CornerNet-Squeeze
+            <li>YOLOv3</li>
- YOLOv4
+            <li>YOLOv4</li>  
- PP-YOLO
+            <li>PP-YOLO</li>
+            <li>SSD</li>
-More Backbones:
+          </ul>
+        </ul>
- DarkNet
+        <ul><li><b>Anchor Free</b></li>
- VGG
+          <ul>
- GCNet
+            <li>CornerNet-Squeeze</li>
- CBNet
+            <li>FCOS</li>  
+            <li>TTFNet</li>
-Advanced Features:
+          </ul>
+        </ul>
- [x] **Synchronized Batch Norm**
+        <ul>
- [x] **Group Norm**
+          <li><b>Instance Segmentation</b></li>
- [x] **Modulated Deformable Convolution**
+            <ul>
- [x] **Deformable PSRoI Pooling**
+             <li>Mask RCNN</li>
- [x] **Non-local and GCNet**
+             <li>SOLOv2</li>
+            </ul>
-**NOTE:** Synchronized batch normalization can only be used on multiple GPU devices, can not be used on CPU devices or single GPU device.
+        </ul>
+        <ul>
-The following is the relationship between COCO mAP and FPS on Tesla V100 of representative models of each architectures and backbones.
+          <li><b>Face-Detction</b></li>
+            <ul>
+             <li>FaceBoxes</li>
+             <li>BlazeFace</li>
+             <li>BlazeFace-NAS</li>
+            </ul>
+        </ul>
+      </td>
+      <td>
+        <ul>
+          <li>ResNet(&vd)</li>
+          <li>ResNeXt(&vd)</li>
+          <li>SENet</li>
+          <li>Res2Net</li>
+          <li>HRNet</li>
+          <li>Hourglass</li>
+          <li>CBNet</li>
+          <li>GCNet</li>
+          <li>DarkNet</li>
+          <li>CSPDarkNet</li>
+          <li>VGG</li>
+          <li>MobileNetv1/v3</li>  
+          <li>GhostNet</li>
+          <li>Efficientnet</li>  
+        </ul>
+      </td>
+      <td>
+        <ul><li><b>Common</b></li>
+          <ul>
+            <li>Sync-BN</li>
+            <li>Group Norm</li>
+            <li>DCNv2</li>
+            <li>Non-local</li>
+          </ul>  
+        </ul>
+        <ul><li><b>FPN</b></li>
+          <ul>
+            <li>BiFPN</li>
+            <li>BFP</li>  
+            <li>HRFPN</li>
+            <li>ACFPN</li>
+          </ul>  
+        </ul>  
+        <ul><li><b>Loss</b></li>
+          <ul>
+            <li>Smooth-L1</li>
+            <li>GIoU/DIoU/CIoU</li>  
+            <li>IoUAware</li>
+          </ul>  
+        </ul>  
+        <ul><li><b>Post-processing</b></li>
+          <ul>
+            <li>SoftNMS</li>
+            <li>MatrixNMS</li>  
+          </ul>  
+        </ul>
+        <ul><li><b>Speed</b></li>
+          <ul>
+            <li>FP16 training</li>
+            <li>Multi-machine training </li>  
+          </ul>  
+        </ul>  
+      </td>
+      <td>
+        <ul>
+          <li>Resize</li>  
+          <li>Flipping</li>  
+          <li>Expand</li>
+          <li>Crop</li>
+          <li>Color Distort</li>  
+          <li>Random Erasing</li>  
+          <li>Mixup </li>
+          <li>Cutmix </li>
+          <li>Grid Mask</li>
+          <li>Auto Augment</li>  
+        </ul>  
+      </td>  
+    </tr>
+</td>
+    </tr>
+  </tbody>
+</table>
+#### Overview of Model Performance
+The relationship between COCO mAP and FPS on Tesla V100 of representative models of each architectures and backbones.
 <div align="center">
-  <img src="docs/images/map_fps.png" width=800 />
+  <img src="docs/images/map_fps.png" />
 </div>
 **NOTE:**
- `CBResNet` stands for `Cascade-Faster-RCNN-CBResNet200vd-FPN`, which has highest mAP on COCO as 53.3% in PaddleDetection models
- `Cascade-Faster-RCNN` stands for `Cascade-Faster-RCNN-ResNet50vd-DCN`, which has been optimized to 20 FPS inference speed when COCO mAP as 47.8%
- The enhanced `YOLOv3-ResNet50vd-DCN` is 10.6 absolute percentage points higher than paper on COCO mAP, and inference speed is nearly 70% faster than the darknet framework
- All these models can be get in [Model Zoo](#Model-Zoo)
-The following is the relationship between COCO mAP and FPS on Tesla V100 of SOTA object detecters and PP-YOLO, which is faster and has better performance than YOLOv4, and reached mAP(0.5:0.95) as 45.2% on COCO test2019 dataset and 72.9 FPS on single Test V100. Please refer to [PP-YOLO](configs/ppyolo/README.md) for details.
+- `CBResNet stands` for `Cascade-Faster-RCNN-CBResNet200vd-FPN`, which has highest mAP on COCO as 53.3%
-<div align="center">
+- `Cascade-Faster-RCNN` stands for `Cascade-Faster-RCNN-ResNet50vd-DCN`, which has been optimized to 20 FPS inference speed when COCO mAP as 47.8% in PaddleDetection models
-  <img src="docs/images/ppyolo_map_fps.png" width=600 />
-</div>
+- The enhanced PaddleDetection model `YOLOv3-ResNet50vd-DCN` is 10.6 absolute percentage points higher than paper on COCO mAP, and inference speed is 61.3 fps, nearly 70% faster than the darknet framework.
+All these models can be get in [Model Zoo](#ModelZoo)
-## Tutorials
+## Tutorials
 ### Get Started
- [Installation guide](docs/tutorials/INSTALL.md)
+- [Installation guide](docs/tutorials/INSTALL_cn.md)
- [Quick start on small dataset](docs/tutorials/QUICK_STARTED.md)
+- [Quick start on small dataset](docs/tutorials/QUICK_STARTED_cn.md)
 - [Prepare dataset](docs/tutorials/PrepareDataSet.md)
 - [Train/Evaluation/Inference/Deploy](docs/tutorials/DetectionPipeline.md)
 - [How to train a custom dataset](docs/tutorials/Custom_DataSet.md)
 - [FAQ](docs/FAQ.md)
-### Advanced Tutorial
+### Advanced Tutorials
- [Guide to preprocess pipeline and dataset definition](docs/advanced_tutorials/READER.md)
+- Parameter configuration
- [Models technical](docs/advanced_tutorials/MODEL_TECHNICAL.md)
+  - [Introduction to the configuration workflow](docs/advanced_tutorials/config_doc/CONFIG_cn.md)
- [Transfer learning document](docs/advanced_tutorials/TRANSFER_LEARNING.md)
- [Parameter configuration](docs/advanced_tutorials/config_doc):
-  - [Introduction to the configuration workflow](docs/advanced_tutorials/config_doc/CONFIG.md)
  - [Parameter configuration for RCNN model](docs/advanced_tutorials/config_doc/RCNN_PARAMS_DOC.md)
- [IPython Notebook demo](demo/mask_rcnn_demo.ipynb)
+  - [Parameter configuration for YOLOv3 model](docs/advanced_tutorials/config_doc/yolov3_mobilenet_v1.md)
- [Model compression](slim)
-    - [Model compression benchmark](slim)
+- Tansfer learning
-    - [Quantization](slim/quantization)
+  - [How to load pretrained model](docs/advanced_tutorials/TRANSFER_LEARNING_cn.md)
-    - [Model pruning](slim/prune)
-    - [Model distillation](slim/distillation)
+- Model Compression(Based on [PaddleSlim](https://github.com/PaddlePaddle/PaddleSlim))
-    - [Neural Architecture Search](slim/nas)
+  - [Model compression benchmark](slim)
- [Deployment](deploy)
+  - [Quantization](slim/quantization)
-    - [Export model for inference](docs/advanced_tutorials/deploy/EXPORT_MODEL.md)
+  - [Model pruning](slim/prune)
-    - [Python inference](deploy/python)
+  - [Model distillation](slim/distillation)
-    - [C++ inference](deploy/cpp)
+  - [Neural Architecture Search](slim/nas)
-    - [Mobile](https://github.com/PaddlePaddle/Paddle-Lite-Demo)
-    - [Serving](deploy/serving)
+- Inference and deployment
-    - [Inference benchmark](docs/advanced_tutorials/deploy/BENCHMARK_INFER_cn.md)
+  - [Export model for inference](docs/advanced_tutorials/deploy/EXPORT_MODEL.md)
+  - [Python inference](deploy/python)
+  - [C++ inference](deploy/cpp)
+  - [Mobile](https://github.com/PaddlePaddle/Paddle-Lite-Demo)
+  - [Serving](deploy/serving)
+  - [Inference benchmark](docs/advanced_tutorials/deploy/BENCHMARK_INFER_cn.md)
+- Advanced development
+  - [New data augmentations](docs/advanced_tutorials/READER.md)
+  - [New detection algorithms](docs/advanced_tutorials/MODEL_TECHNICAL.md)
 ## Model Zoo
- Pretrained models are available in the [PaddleDetection model zoo](docs/MODEL_ZOO.md).
+- Universal object detection
- [Mobile models](configs/mobile/README.md)
+  - [Model library and baselines](docs/MODEL_ZOO_cn.md)
- [Anchor free models](configs/anchor_free/README.md)
+  - [Mobile models](configs/mobile/README.md)
- [Face detection models](docs/featured_model/FACE_DETECTION_en.md)
+  - [Anchor free models](configs/anchor_free/README.md)
- [Pretrained models for pedestrian detection](docs/featured_model/CONTRIB.md)
+  - [PP-YOLO](configs/ppyolo/README_cn.md)
- [Pretrained models for vehicle detection](docs/featured_model/CONTRIB.md)
+  - [676 classes of object detection](docs/featured_model/LARGE_SCALE_DET_MODEL.md)
- [YOLOv3 enhanced model](docs/featured_model/YOLOv3_ENHANCEMENT.md): Compared to MAP of 33.0% in paper, enhanced YOLOv3 reaches the MAP of 43.6%, and inference speed is improved as well
+  - [Two-stage practical PSS-Det](configs/rcnn_enhance/README.md)
- [PP-YOLO](configs/ppyolo/README.md): PP-YOLO reeached mAP as 45.3% on COCO dataset，and 72.9 FPS on single Tesla V100
+- Vertical field
- [Objects365 2019 Challenge champion model](docs/featured_model/champion_model/CACascadeRCNN.md)
+  - [Face detection](docs/featured_model/FACE_DETECTION.md)
- [Best single model of Open Images 2019-Object Detction](docs/featured_model/champion_model/OIDV5_BASELINE_MODEL.md)
+  - [Pedestrian detection](docs/featured_model/CONTRIB_cn.md)
- [Practical Server-side detection method](configs/rcnn_enhance/README_en.md): Inference speed on single V100 GPU can reach 20FPS when COCO mAP is 47.8%.
+  - [Vehicle detection](docs/featured_model/CONTRIB_cn.md)
- [Large-scale practical object detection models](docs/featured_model/LARGE_SCALE_DET_MODEL_en.md): Large-scale practical server-side detection pretrained models with 676 categories are provided for most application scenarios, which can be used not only for direct inference but also finetuning on other datasets.
+- Competition Plan
+  - [Objects365 2019 Challenge champion model](docs/featured_model/champion_model/CACascadeRCNN.md)
+  - [Best single model of Open Images 2019-Object Detction](docs/featured_model/champion_model/OIDV5_BASELINE_MODEL.md)
+## Updates
+v0.4.0 was released at `07/2020`, add PP-YOLO, TTFNet, HTC, ACFPN, etc. And add BlaceFace face landmark detection model, add a series of optimized SSDLite models on mobile side, add data augmentations GridMask and RandomErasing, add Matrix NMS and EMA training, and improved ease of use, fix many known bugs, etc. Please refer to [change log](docs/CHANGELOG.md) for details.
 ## License
 PaddleDetection is released under the [Apache 2.0 license](LICENSE).
-## Updates
-v0.4.0 was released at `05/2020`, add PP-YOLO, TTFNet, HTC, ACFPN, etc. And add BlaceFace face landmark detection model, add a series of optimized SSDLite models on mobile side, add data augmentations GridMask and RandomErasing, add Matrix NMS and EMA training, and improved ease of use, fix many known bugs, etc.
-Please refer to [版本更新文档](docs/CHANGELOG.md) for details.
 ## Contributing