README.md 12.5 KB
Newer Older
G
gineshidalgo99 已提交
1 2 3
OpenPose
====================================

4 5 6 7 8 9 10 11
## Latest News
- Apr 2017: Body released!
- May 2017: Windows version released!
- Jun 2017: Face released!
- Check all the [release notes](doc/release_notes.md).



G
gineshidalgo99 已提交
12 13
## Introduction

G
gineshidalgo99 已提交
14
OpenPose is a **library for real-time multi-person keypoint detection and multi-threading written in C++** using OpenCV and Caffe*, authored by [Gines Hidalgo](https://www.linkedin.com/in/gineshidalgo/), [Zhe Cao](http://www.andrew.cmu.edu/user/zhecao), [Tomas Simon](http://www.cs.cmu.edu/~tsimon/), [Shih-En Wei](https://scholar.google.com/citations?user=sFQD3k4AAAAJ&hl=en), [Hanbyul Joo](http://www.cs.cmu.edu/~hanbyulj/) and [Yaser Sheikh](http://www.cs.cmu.edu/~yaser/).
G
gineshidalgo99 已提交
15

16
\* It uses Caffe, but the code is ready to be ported to other frameworks (Tensorflow, Torch, etc.). If you implement any of those, feel free to make a pull request!
G
gineshidalgo99 已提交
17

G
gineshidalgo99 已提交
18 19
OpenPose represents the **first real-time system to jointly detect human body, hand and facial keypoints (in total 130 keypoints) on single images**. In addition, the system computational performance on body keypoint estimation is invariant to the number of detected people in the image.

G
gineshidalgo99 已提交
20 21 22 23 24 25
OpenPose is freely available for free non-commercial use, and may be redistributed under these conditions. Please, see the [license](LICENSE) for further details. Contact us for commercial purposes.



Library main functionality:

26
* Multi-person 15 or **18-keypoint body pose** estimation and rendering. **Running time invariant of number of people** on the image.
G
gineshidalgo99 已提交
27

28
* Multi-person **2x21-keypoint hand** estimation and rendering. Note: In this initial version, **running time** linearly **depends** on the **number of people** on the image. **Coming soon (in around 1-5 weeks)!**
G
gineshidalgo99 已提交
29

30
* Multi-person **70-keypoint face** estimation and rendering. Note: In this initial version, **running time** linearly **depends** on the **number of people** on the image.
G
gineshidalgo99 已提交
31 32

* Flexible and easy-to-configure **multi-threading** module.
G
gineshidalgo99 已提交
33

T
tsimon 已提交
34
* Image, video, and webcam reader.
G
gineshidalgo99 已提交
35

G
gineshidalgo99 已提交
36
* Able to save and load the results in various formats (JSON, XML, PNG, JPG, ...).
G
gineshidalgo99 已提交
37 38 39

* Small display and GUI for simple result visualization.

G
gineshidalgo99 已提交
40
* All the functionality is wrapped into a **simple-to-use OpenPose Wrapper class**.
G
gineshidalgo99 已提交
41

Z
Zhe Cao 已提交
42
The pose estimation work is based on the C++ code from [the ECCV 2016 demo](https://github.com/CMU-Perceptual-Computing-Lab/caffe_rtpose), "Realtime Multiperson Pose Estimation", [Zhe Cao](http://www.andrew.cmu.edu/user/zhecao), [Tomas Simon](http://www.cs.cmu.edu/~tsimon/), [Shih-En Wei](https://scholar.google.com/citations?user=sFQD3k4AAAAJ&hl=en), [Yaser Sheikh](http://www.cs.cmu.edu/~yaser/). The [full project repo](https://github.com/ZheC/Multi-Person-Pose-Estimation) includes Matlab and Python version, as well as training code.
G
gineshidalgo99 已提交
43 44 45 46



## Results
G
gineshidalgo99 已提交
47

G
gineshidalgo99 已提交
48
### Body Estimation
G
gineshidalgo99 已提交
49
<p align="center">
G
gineshidalgo99 已提交
50
    <img src="doc/media/dance.gif", width="480">
G
gineshidalgo99 已提交
51 52
</p>

53 54 55 56 57
### Body + Face Estimation
<p align="center">
    <img src="doc/media/pose_face.gif", width="480">
</p>

G
gineshidalgo99 已提交
58 59
## Coming Soon (But Already Working!)

G
gineshidalgo99 已提交
60
### Body + Hands + Face Estimation
G
gineshidalgo99 已提交
61
<p align="center">
G
gineshidalgo99 已提交
62
    <img src="doc/media/pose_face_hands.gif", width="480">
G
gineshidalgo99 已提交
63 64
</p>

G
gineshidalgo99 已提交
65
### Body + Hands
G
gineshidalgo99 已提交
66
<p align="center">
G
gineshidalgo99 已提交
67
    <img src="doc/media/pose_hands.gif", width="480">
G
gineshidalgo99 已提交
68 69 70 71
</p>


## Contents
G
gineshidalgo99 已提交
72
1. [Installation, Reinstallation and Uninstallation](#installation-reinstallation-and-uninstallation)
73 74
2. [Custom Caffe](#custom-caffe)
3. [Quick Start](#quick-start)
G
gineshidalgo99 已提交
75 76 77
    1. [Demo](#demo)
    2. [OpenPose Wrapper](#openpose-wrapper)
    3. [OpenPose Library](#openpose-library)
78
4. [Output](#output)
G
gineshidalgo99 已提交
79 80
    1. [Output Format](#output-format)
    2. [Reading Saved Results](#reading-saved-results)
81 82 83
5. [OpenPose Benchmark](#openpose-benchmark)
6. [Send Us Your Feedback!](#send-us-your-feedback)
7. [Citation](#citation)
G
Gines Hidalgo 已提交
84
8. [Other Contributors](#other-contributors)
G
gineshidalgo99 已提交
85 86 87



G
gineshidalgo99 已提交
88 89
## Installation, Reinstallation and Uninstallation
You can find the installation, reinstallation and uninstallation steps on: [doc/installation.md](doc/installation.md).
G
gineshidalgo99 已提交
90 91 92



93 94 95 96 97 98 99 100 101 102 103 104 105
## Custom Caffe
We only modified some Caffe compilation flags and minor details. You can use use your own Caffe distribution, these are the files we added and modified:

1. Added files: `install_caffe.sh`; as well as `Makefile.config.Ubuntu14.example`, `Makefile.config.Ubuntu16.example`, `Makefile.config.Ubuntu14_cuda_7.example` and `Makefile.config.Ubuntu16_cuda_7.example` (extracted from `Makefile.config.example`). Basically, you must enable cuDNN.
2. Edited file: Makefile. Search for "# OpenPose: " to find the edited code. We basically added the C++11 flag to avoid issues in some old computers.
3. Optional - deleted Caffe file: `Makefile.config.example`.
4. In order to link it to OpenPose:
    1. Run `make all && make distribute` in your Caffe version.
    2. Open the OpenPose Makefile config file: `./Makefile.config.UbuntuX.example` (where X depends on your OS and CUDA version).
    3. Modify the Caffe folder directory variable (`CAFFE_DIR`) to your custom Caffe `distribute` folder location in the previous OpenPose Makefile config file.



G
gineshidalgo99 已提交
106
## Quick Start
T
tsimon 已提交
107
Most users cases should not need to dive deep into the library, they might just be able to use the [Demo](#demo) or the simple [OpenPose Wrapper](#openpose-wrapper). So you can most probably skip the library details in [OpenPose Library](#openpose-library).
G
gineshidalgo99 已提交
108 109 110 111 112 113



#### Demo
Your case if you just want to process a folder of images or video or webcam and display or save the pose results.

G
gineshidalgo99 已提交
114
Forget about the OpenPose library details and just read the [doc/demo_overview.md](doc/demo_overview.md) 1-page section.
G
gineshidalgo99 已提交
115 116 117 118 119 120 121 122 123

#### OpenPose Wrapper
Your case if you want to read a specific format of image source and/or add a specific post-processing function and/or implement your own display/saving.

(Almost) forget about the library, just take a look to the `Wrapper` tutorial on [examples/tutorial_wrapper/](examples/tutorial_wrapper/).

Note: you should not need to modify OpenPose source code or examples, so that you can directly upgrade the OpenPose library anytime in the future without changing your code. You might create your custom code on [examples/user_code/](examples/user_code/) and compile it by using `make all` in the OpenPose folder.

#### OpenPose Library
T
tsimon 已提交
124
Your case if you want to change internal functions and/or extend its functionality. First, take a look at the [Demo](#demo) and [OpenPose Wrapper](#openpose-wrapper). Second, read the 2 following subsections: OpenPose Overview and Extending Functionality.
G
gineshidalgo99 已提交
125

T
tsimon 已提交
126
1. OpenPose Overview: Learn the basics about the library source code in [doc/library_overview.md](doc/library_overview.md).
G
gineshidalgo99 已提交
127

T
tsimon 已提交
128
2. Extending Functionality: Learn how to extend the library in [doc/library_extend_functionality.md](doc/library_extend_functionality.md).
G
gineshidalgo99 已提交
129

T
tsimon 已提交
130
3. Adding An Extra Module: Learn how to add an extra module in [doc/library_add_new_module.md](doc/library_add_new_module.md).
G
gineshidalgo99 已提交
131 132

#### Doxygen Documentation Autogeneration
T
tsimon 已提交
133
You can generate the documentation by running the following command. The documentation will be generated in `doc/doxygen/html/index.html`. You can simply open it with double click (your default browser should automatically display it).
G
gineshidalgo99 已提交
134
```
G
gineshidalgo99 已提交
135 136
cd doc/
doxygen doc_autogeneration.doxygen
G
gineshidalgo99 已提交
137 138 139 140 141 142
```



## Output
#### Output Format
143
There are 2 alternatives to save the **(x,y,score) body part locations**. The `write_keypoint` flag uses the OpenCV cv::FileStorage default formats (JSON, XML and YML). However, the JSON format is only available after OpenCV 3.0. Hence, `write_keypoint_json` saves the people pose data using a custom JSON writer. For the latter, each JSON file has a `people` array of objects, where each object has an array `body_parts` containing the body part locations and detection confidence formatted as `x1,y1,c1,x2,y2,c2,...`. The coordinates `x` and `y` can be normalized to the range [0,1], [-1,1], [0, source size], [0, output size], etc., depending on the flag `keypoint_scale`. In addition, `c` is the confidence in the range [0,1].
G
gineshidalgo99 已提交
144 145 146 147 148 149 150 151 152 153 154

```
{
    "version":0.1,
    "people":[
        {"body_parts":[1114.15,160.396,0.846207,...]},
        {"body_parts":[...]},
    ]
}
```

T
tsimon 已提交
155
The body part order of the COCO (18 body parts) and MPI (15 body parts) keypoints is described in `POSE_BODY_PART_MAPPING` in [include/openpose/pose/poseParameters.hpp](include/openpose/pose/poseParameters.hpp). E.g., for COCO:
G
gineshidalgo99 已提交
156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179
```
    POSE_COCO_BODY_PARTS {
        {0,  "Nose"},
        {1,  "Neck"},
        {2,  "RShoulder"},
        {3,  "RElbow"},
        {4,  "RWrist"},
        {5,  "LShoulder"},
        {6,  "LElbow"},
        {7,  "LWrist"},
        {8,  "RHip"},
        {9,  "RKnee"},
        {10, "RAnkle"},
        {11, "LHip"},
        {12, "LKnee"},
        {13, "LAnkle"},
        {14, "REye"},
        {15, "LEye"},
        {16, "REar"},
        {17, "LEar"},
        {18, "Bkg"},
    }
```

T
tsimon 已提交
180
For the **heat maps storing format**, instead of individually saving each of the 67 heatmaps (18 body parts + background + 2 x 19 PAFs) individually, the library concatenates them into a huge (width x #heat maps) x (height) matrix, i.e. it concats the heat maps by columns. E.g., columns [0, individual heat map width] contains the first heat map, columns [individual heat map width + 1, 2 * individual heat map width] contains the second heat map, etc. Note that some image viewers are not able to display the resulting images due to the size. However, Chrome and Firefox are able to properly open them.
G
gineshidalgo99 已提交
181

T
tsimon 已提交
182
The saving order is body parts + background + PAFs. Any of them can be disabled with program flags. If background is disabled, then the final image will be body parts + PAFs. The body parts and background follow the order of `POSE_COCO_BODY_PARTS` or `POSE_MPI_BODY_PARTS`, while the PAFs follow the order specified on POSE_BODY_PART_PAIRS in `poseParameters.hpp`. E.g., for COCO:
G
gineshidalgo99 已提交
183 184 185 186
```
    POSE_COCO_PAIRS    {1,2,   1,5,   2,3,   3,4,   5,6,   6,7,   1,8,   8,9,   9,10, 1,11,  11,12, 12,13,  1,0,   0,14, 14,16,  0,15, 15,17,   2,16,  5,17};
```

T
tsimon 已提交
187
Where each index is the key value corresponding to each body part in `POSE_COCO_BODY_PARTS`, e.g., 0 for "Neck", 1 for "RShoulder", etc.
G
gineshidalgo99 已提交
188 189

#### Reading Saved Results
T
tsimon 已提交
190
We use standard formats (JSON, XML, PNG, JPG, ...) to save our results, so there will be lots of frameworks to read them later, but you might also directly use our functions in [include/openpose/filestream.hpp](include/openpose/filestream.hpp). In particular, `loadData` (for JSON, XML and YML files) and `loadImage` (for image formats such as PNG or JPG) to load the data into cv::Mat format.
G
gineshidalgo99 已提交
191

192 193
#### Pose Output Format
<p align="center">
G
gineshidalgo99 已提交
194
    <img src="doc/media/keypoints_pose.png", width="480">
195 196 197 198
</p>

#### Face Output Format
<p align="center">
199
    <img src="doc/media/keypoints_face.png", width="480">
200 201
</p>

G
gineshidalgo99 已提交
202 203


G
gineshidalgo99 已提交
204
## OpenPose Benchmark
T
tsimon 已提交
205
Initial library running time benchmark on [OpenPose Benchmark](https://docs.google.com/spreadsheets/d/1-DynFGvoScvfWDA1P4jDInCkbD4lg0IKOYbXgEq0sK0/edit#gid=0). You can comment in that document with your graphics card model and running time for that model, and we will add your results to the benchmark!
G
gineshidalgo99 已提交
206 207 208



T
tsimon 已提交
209
## Send Us Your Feedback!
G
gineshidalgo99 已提交
210 211 212 213
Our library is open source for research purposes, and we want to continuously improve it! So please, let us know if...

1. ... you find any bug (in functionality or speed).

T
tsimon 已提交
214
2. ... you added some functionality to some class or some new Worker<T> subclass which we might potentially incorporate.
G
gineshidalgo99 已提交
215

T
tsimon 已提交
216
3. ... you know how to speed up or improve any part of the library.
G
gineshidalgo99 已提交
217

T
tsimon 已提交
218
4. ... you have a request about possible functionality.
G
gineshidalgo99 已提交
219 220 221

5. ... etc.

T
tsimon 已提交
222
Just comment on GibHub or make a pull request and we will answer as soon as possible! Send us an email if you use the library to make a cool demo or YouTube video!
G
gineshidalgo99 已提交
223 224 225 226



## Citation
T
tsimon 已提交
227
Please cite the papers in your publications if it helps your research:
G
gineshidalgo99 已提交
228 229 230 231 232 233 234 235

    @inproceedings{cao2017realtime,
      author = {Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
      year = {2017}
      }

G
gineshidalgo99 已提交
236 237 238 239 240 241 242
    @inproceedings{simon2017hand,
      author = {Tomas Simon and Hanbyul Joo and Iain Matthews and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Hand Keypoint Detection in Single Images using Multiview Bootstrapping},
      year = {2017}
      }

G
gineshidalgo99 已提交
243 244 245 246 247 248
    @inproceedings{wei2016cpm,
      author = {Shih-En Wei and Varun Ramakrishna and Takeo Kanade and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Convolutional pose machines},
      year = {2016}
      }
249 250 251 252



## Other Contributors
G
Gines Hidalgo 已提交
253
We would like to thank the following people who also contributed to OpenPose:
254 255

1. [Helen Medina](https://github.com/helen-medina): For moving OpenPose to Windows (Visual Studio), making it work there and creating the Windows branch.