README.md 16.1 KB
Newer Older
G
gineshidalgo99 已提交
1
<div align="center">
G
gineshidalgo99 已提交
2
    <img src=".github/Logo_main_black.png", width="300">
G
gineshidalgo99 已提交
3
</div>
G
gineshidalgo99 已提交
4

G
gineshidalgo99 已提交
5
-----------------
G
gineshidalgo99 已提交
6

7 8
|                  |`Default Config`  |`CUDA (+Python)`  |`CPU (+Python)`   |`OpenCL (+Python)`| `Debug`          | `Unity`          |
| :---:            | :---:            | :---:            | :---:            | :---:            | :---:            | :---:            |
G
gineshidalgo99 已提交
9
| **`Linux`**   | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/1)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/2)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/3)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/4)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/5)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/6)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) |
10
| **`MacOS`**   | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/7)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/7)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/8)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/9)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/10)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/11)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) |
G
Gines Hidalgo 已提交
11
| **`Windows`** | [![Status](https://ci.appveyor.com/api/projects/status/5leescxxdwen77kg/branch/master?svg=true)](https://ci.appveyor.com/project/gineshidalgo99/openpose/branch/master) | | | | |
G
gineshidalgo99 已提交
12 13 14
<!--
Note: Currently using [travis-matrix-badges](https://github.com/bjfish/travis-matrix-badges) vs. traditional [![Build Status](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose.svg?branch=master)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose)
-->
B
Bikramjot Hanzra 已提交
15

G
gineshidalgo99 已提交
16 17
[**OpenPose**](https://github.com/CMU-Perceptual-Computing-Lab/openpose) represents the **first real-time multi-person system to jointly detect human body, hand, facial, and foot keypoints (in total 135 keypoints) on single images**.

18
It is **authored by [Gines Hidalgo](https://www.gineshidalgo.com), [Zhe Cao](https://people.eecs.berkeley.edu/~zhecao), [Tomas Simon](http://www.cs.cmu.edu/~tsimon), [Shih-En Wei](https://scholar.google.com/citations?user=sFQD3k4AAAAJ&hl=en), [Hanbyul Joo](https://jhugestar.github.io), and [Yaser Sheikh](http://www.cs.cmu.edu/~yaser)**. Currently, it is being **maintained by [Gines Hidalgo](https://www.gineshidalgo.com) and [Yaadhav Raaj](https://www.raaj.tech)**. In addition, OpenPose would not be possible without the [**CMU Panoptic Studio dataset**](http://domedb.perception.cs.cmu.edu). We would also like to thank all the people who helped OpenPose in any way. The main contributors are listed in [doc/contributors.md](doc/contributors.md).
G
gineshidalgo99 已提交
19 20 21

<!-- The [original CVPR 2017 repo](https://github.com/ZheC/Multi-Person-Pose-Estimation) includes Matlab and Python versions, as well as the training code. The body pose estimation work is based on [the original ECCV 2016 demo](https://github.com/CMU-Perceptual-Computing-Lab/caffe_rtpose). -->

22

G
Gines 已提交
23 24
<p align="center">
    <img src="doc/media/pose_face_hands.gif", width="480">
G
gineshidalgo99 已提交
25 26
    <br>
    <sup>Authors <a href="https://www.gineshidalgo.com" target="_blank">Gines Hidalgo</a> (left) and <a href="https://jhugestar.github.io" target="_blank">Hanbyul Joo</a> (right) in front of the <a href="http://domedb.perception.cs.cmu.edu" target="_blank">CMU Panoptic Studio</a></sup>
G
Gines 已提交
27
</p>
28

29 30 31
## Features
- **Functionality**:
    - **2D real-time multi-person keypoint detection**:
G
gineshidalgo99 已提交
32
        - 15 or 18 or **25-keypoint body/foot keypoint estimation**. **Running time invariant to number of detected people**.
G
Gines Hidalgo 已提交
33
        - **6-keypoint foot keypoint estimation**. Integrated together with the 25-keypoint body/foot keypoint detector.
34 35
        - **2x21-keypoint hand keypoint estimation**. Currently, **running time depends** on **number of detected people**.
        - **70-keypoint face keypoint estimation**. Currently, **running time depends** on **number of detected people**.
36
    - **3D real-time single-person keypoint detection**:
37 38 39 40 41
        - 3-D triangulation from multiple single views.
        - Synchronization of Flir cameras handled.
        - Compatible with Flir/Point Grey cameras, but provided C++ demos to add your custom input.
    - **Calibration toolbox**:
        - Easy estimation of distortion, intrinsic, and extrinsic camera parameters.
G
gineshidalgo99 已提交
42
    - **Single-person tracking** for further speed up or visual smoothing.
43
- **Input**: Image, video, webcam, Flir/Point Grey and IP camera. Included C++ demos to add your custom input.
G
gineshidalgo99 已提交
44
- **Output**: Basic image + keypoint display/saving (PNG, JPG, AVI, ...), keypoint saving (JSON, XML, YML, ...), and/or keypoints as array class.
45
- **OS**: Ubuntu (14, 16), Windows (8, 10), Mac OSX, Nvidia TX2.
G
Gines Hidalgo 已提交
46
- **Training and datasets**:
47 48
    - [**OpenPose Training**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train).
    - [**Foot dataset website**](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset/).
49 50
- **Others**:
    - Available: command-line demo, C++ wrapper, and C++ API.
51 52
    - [**Python API**](doc/modules/python_module.md).
    - [**Unity Plugin**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_unity_plugin).
53
    - CUDA (Nvidia GPU), OpenCL (AMD GPU), and CPU-only (no GPU) versions.
G
gineshidalgo99 已提交
54

G
gineshidalgo99 已提交
55 56


57
## Latest Features
58
- Sep 2019: [**Training code released**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train)!
R
Raaj 已提交
59 60
- Jan 2019: [**Unity plugin released**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_unity_plugin)!
- Jan 2019: [**Improved Python API**](doc/modules/python_module.md) released! Including body, face, hands, and all the functionality of the C++ API!
G
Gines Hidalgo 已提交
61
- Dec 2018: [**Foot dataset released**](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset) and [**new paper released**](https://arxiv.org/abs/1812.08008)!
Z
Zhe Cao 已提交
62

63
For further details, check [all released features](doc/released_features.md) and [release notes](doc/release_notes.md).
G
gineshidalgo99 已提交
64

G
gineshidalgo99 已提交
65 66


G
gineshidalgo99 已提交
67
## Results
68
### Body and Foot Estimation
69
<p align="center">
G
gineshidalgo99 已提交
70
    <img src="doc/media/dance_foot.gif", width="360">
G
gineshidalgo99 已提交
71 72
    <br>
    <sup>Testing the <a href="https://www.youtube.com/watch?v=2DiQUX11YaY" target="_blank"><i>Crazy Uptown Funk flashmob in Sydney</i></a> video sequence with OpenPose</sup>
73 74
</p>

75
### 3-D Reconstruction Module (Body, Foot, Face, and Hands)
G
gineshidalgo99 已提交
76
<p align="center">
G
gineshidalgo99 已提交
77
    <img src="doc/media/openpose3d.gif", width="360">
G
gineshidalgo99 已提交
78 79
    <br>
    <sup>Testing the 3D Reconstruction Module of OpenPose</sup>
G
gineshidalgo99 已提交
80
</p>
G
gineshidalgo99 已提交
81

82
### Body, Foot, Face, and Hands Estimation
G
gineshidalgo99 已提交
83
<p align="center">
84
    <img src="doc/media/pose_face.gif", width="360">
G
gineshidalgo99 已提交
85
    <img src="doc/media/pose_hands.gif", width="360">
G
gineshidalgo99 已提交
86 87
    <br>
    <sup>Authors <a href="https://www.gineshidalgo.com" target="_blank">Gines Hidalgo</a> (left image) and <a href="http://www.cs.cmu.edu/~tsimon" target="_blank">Tomas Simon</a> (right image) testing OpenPose</sup>
G
gineshidalgo99 已提交
88
</p>
G
Gines 已提交
89

90
### Unity Plugin
G
gineshidalgo99 已提交
91
<p align="center">
92 93 94
    <img src="doc/media/unity_main.png", width="240">
    <img src="doc/media/unity_body_foot.png", width="240">
    <img src="doc/media/unity_hand_face.png", width="240">
G
gineshidalgo99 已提交
95 96
    <br>
    <sup><a href="http://tianyizhao.com" target="_blank">Tianyi Zhao</a> and <a href="https://www.gineshidalgo.com" target="_blank">Gines Hidalgo</a> testing their <a href="https://github.com/CMU-Perceptual-Computing-Lab/openpose_unity_plugin" target="_blank">OpenPose Unity Plugin</a></sup>
G
gineshidalgo99 已提交
97 98
</p>

G
gineshidalgo99 已提交
99
### Runtime Analysis
100
Inference time comparison between the 3 available pose estimation libraries: OpenPose, Alpha-Pose (fast Pytorch version), and Mask R-CNN:
G
gineshidalgo99 已提交
101
<p align="center">
G
gineshidalgo99 已提交
102
    <img src="doc/media/openpose_vs_competition.png", width="360">
G
gineshidalgo99 已提交
103 104 105
</p>
This analysis was performed using the same images for each algorithm and a batch size of 1. Each analysis was repeated 1000 times and then averaged. This was all performed on a system with a Nvidia 1080 Ti and CUDA 8. Megvii (Face++) and MSRA GitHub repositories were excluded because they only provide pose estimation results given a cropped person. However, they suffer the same problem than Alpha-Pose and Mask R-CNN, their runtimes grow linearly with the number of people.

G
Gines 已提交
106 107


G
gineshidalgo99 已提交
108 109 110 111 112 113 114 115
## Contents
1. [Features](#features)
2. [Latest Features](#latest-features)
3. [Results](#results)
4. [Installation, Reinstallation and Uninstallation](#installation-reinstallation-and-uninstallation)
5. [Quick Start](#quick-start)
6. [Output](#output)
7. [Speeding Up OpenPose and Benchmark](#speeding-up-openpose-and-benchmark)
G
Gines Hidalgo 已提交
116
8. [Training Code and Foot Dataset](#training-code-and-foot-dataset)
117
9. [Send Us Failure Cases and Feedback!](#send-us-failure-cases-and-feedback)
G
gineshidalgo99 已提交
118 119
10. [Citation](#citation)
11. [License](#license)
G
gineshidalgo99 已提交
120 121 122



G
gineshidalgo99 已提交
123
## Installation, Reinstallation and Uninstallation
124 125 126
**Windows portable version**: Simply download and use the latest version from the [Releases](https://github.com/CMU-Perceptual-Computing-Lab/openpose/releases) section.

Otherwise, check [doc/installation.md](doc/installation.md) for instructions on how to build OpenPose from source.
G
gineshidalgo99 已提交
127 128 129 130



## Quick Start
131
Most users do not need the OpenPose C++/Python API, but can simply use the OpenPose Demo:
G
gineshidalgo99 已提交
132

G
gineshidalgo99 已提交
133
- **OpenPose Demo**: To easily process images/video/webcam and display/save the results. See [doc/demo_overview.md](doc/demo_overview.md). E.g., run OpenPose in a video with:
G
gineshidalgo99 已提交
134 135 136 137 138 139
```
# Ubuntu
./build/examples/openpose/openpose.bin --video examples/media/video.avi
:: Windows - Portable Demo
bin\OpenPoseDemo.exe --video examples\media\video.avi
```
140

G
gineshidalgo99 已提交
141
- **Calibration toolbox**: To easily calibrate your cameras for 3-D OpenPose or any other stereo vision task. See [doc/modules/calibration_module.md](doc/modules/calibration_module.md).
142

G
gineshidalgo99 已提交
143
- **OpenPose C++ API**: If you want to read a specific input, and/or add your custom post-processing function, and/or implement your own display/saving, check the C++ API tutorial on [examples/tutorial_api_cpp/](examples/tutorial_api_cpp/) and [doc/library_introduction.md](doc/library_introduction.md). You can create your custom code on [examples/user_code/](examples/user_code/) and quickly compile it with CMake when compiling the whole OpenPose project. Quickly **add your custom code**: See [examples/user_code/README.md](examples/user_code/README.md) for further details.
G
gineshidalgo99 已提交
144

G
gineshidalgo99 已提交
145
- **OpenPose Python API**: Analogously to the C++ API, find the tutorial for the Python API on [examples/tutorial_api_python/](examples/tutorial_api_python/).
146

147
- **Adding an extra module**: Check [doc/library_add_new_module.md](./doc/library_add_new_module.md).
G
Gines 已提交
148 149 150

- **Standalone face or hand detector**:
    - **Face** keypoint detection **without body** keypoint detection: If you want to speed it up (but also reduce amount of detected faces), check the OpenCV-face-detector approach in [doc/standalone_face_or_hand_keypoint_detector.md](doc/standalone_face_or_hand_keypoint_detector.md).
G
gineshidalgo99 已提交
151
    - **Use your own face/hand detector**: You can use the hand and/or face keypoint detectors with your own face or hand detectors, rather than using the body detector. E.g., useful for camera views at which the hands are visible but not the body (OpenPose detector would fail). See [doc/standalone_face_or_hand_keypoint_detector.md](doc/standalone_face_or_hand_keypoint_detector.md).
152 153


G
gineshidalgo99 已提交
154 155 156

## Output
Output (format, keypoint index ordering, etc.) in [doc/output.md](doc/output.md).
G
gineshidalgo99 已提交
157 158 159



G
gineshidalgo99 已提交
160
## Speeding Up OpenPose and Benchmark
G
Gines Hidalgo 已提交
161
Check the OpenPose Benchmark as well as some hints to speed up and/or reduce the memory requirements for OpenPose on [doc/speed_up_openpose.md](doc/speed_up_openpose.md).
G
Gines 已提交
162 163 164



G
Gines Hidalgo 已提交
165 166 167 168
## Training Code and Foot Dataset
For training OpenPose, check [github.com/CMU-Perceptual-Computing-Lab/openpose_train](https://github.com/CMU-Perceptual-Computing-Lab/openpose_train).

For the foot dataset, check the [foot dataset website](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset/) and new [OpenPose paper](https://arxiv.org/abs/1812.08008) for more information.
169 170 171



G
gineshidalgo99 已提交
172
## Send Us Failure Cases and Feedback!
G
gineshidalgo99 已提交
173 174
Our library is open source for research purposes, and we want to continuously improve it! So please, let us know if...

G
gineshidalgo99 已提交
175 176 177 178 179 180
1. ... you find videos or images where OpenPose does not seems to work well. Feel free to send them to openposecmu@gmail.com (email only for failure cases!), we will use them to improve the quality of the algorithm!
2. ... you find any bug (in functionality or speed).
3. ... you added some functionality to some class or some new Worker<T> subclass which we might potentially incorporate.
4. ... you know how to speed up or improve any part of the library.
5. ... you have a request about possible functionality.
6. ... etc.
G
gineshidalgo99 已提交
181

G
gineshidalgo99 已提交
182
Just comment on GitHub or make a pull request and we will answer as soon as possible! Send us an email if you use the library to make a cool demo or YouTube video!
G
gineshidalgo99 已提交
183 184 185 186



## Citation
G
Gines Hidalgo 已提交
187
Please cite these papers in your publications if it helps your research. For standard OpenPose, cite `[Cao et al. 2018]`. If you also use the hand and face keypoint detectors, then cite `[Cao et al. 2018]` and `[Simon et al. 2017]` (the face detector was trained using the same procedure than the hand detector).
G
gineshidalgo99 已提交
188

G
Gines Hidalgo 已提交
189 190 191 192 193 194
    @article{8765346,
      author={Z. {Cao} and G. {Hidalgo Martinez} and T. {Simon} and S. {Wei} and Y. A. {Sheikh}},
      journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
      title={OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
      year={2019}, volume={}, number={}, pages={1-1}
    }
195

196 197
    @inproceedings{simon2017hand,
      author = {Tomas Simon and Hanbyul Joo and Iain Matthews and Yaser Sheikh},
G
gineshidalgo99 已提交
198
      booktitle = {CVPR},
199
      title = {Hand Keypoint Detection in Single Images using Multiview Bootstrapping},
G
gineshidalgo99 已提交
200
      year = {2017}
201
    }
G
gineshidalgo99 已提交
202

203 204
    @inproceedings{cao2017realtime,
      author = {Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},
G
gineshidalgo99 已提交
205
      booktitle = {CVPR},
206
      title = {Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
G
gineshidalgo99 已提交
207
      year = {2017}
208
    }
G
gineshidalgo99 已提交
209

G
gineshidalgo99 已提交
210 211 212 213 214
    @inproceedings{wei2016cpm,
      author = {Shih-En Wei and Varun Ramakrishna and Takeo Kanade and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Convolutional pose machines},
      year = {2016}
215
    }
216

H
Harshal Mittal 已提交
217 218
Links to the papers:

G
Gines Hidalgo 已提交
219 220 221
- OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields:
    - [IEEE TPAMI](https://ieeexplore.ieee.org/document/8765346)
    - [ArXiv](https://arxiv.org/abs/1812.08008)
H
Harshal Mittal 已提交
222
- [Hand Keypoint Detection in Single Images using Multiview Bootstrapping](https://arxiv.org/abs/1704.07809)
223
- [Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields](https://arxiv.org/abs/1611.08050)
H
Harshal Mittal 已提交
224 225
- [Convolutional Pose Machines](https://arxiv.org/abs/1602.00134)

226 227


G
gineshidalgo99 已提交
228
## License
S
subail 已提交
229
OpenPose is freely available for free non-commercial use, and may be redistributed under these conditions. Please, see the [license](LICENSE) for further details. Interested in a commercial license? Check this [FlintBox link](https://flintbox.com/public/project/47343/). For commercial queries, use the `Directly Contact Organization` section from the [FlintBox link](https://flintbox.com/public/project/47343/) and also send a copy of that message to [Yaser Sheikh](mailto:yaser@cs.cmu.edu).