README.md 13.7 KB
Newer Older
G
gineshidalgo99 已提交
1
<div align="center">
G
gineshidalgo99 已提交
2
    <img src=".github/Logo_main_black.png", width="300">
G
gineshidalgo99 已提交
3
</div>
G
gineshidalgo99 已提交
4

G
gineshidalgo99 已提交
5
-----------------
G
gineshidalgo99 已提交
6

G
gineshidalgo99 已提交
7 8 9 10 11
|                 | `Python (CUDA GPU)` | `Python (CPU)` | `CUDA GPU` | `CPU`  | `Debug mode` |
| :---:           | :---:               | :---:          | :---:      |:---:   | :---:        |
| **`Linux`**     | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/1)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/2)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/3)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/4)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/5)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) |
| **`MacOS`**     | | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/6)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/7)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) | [![Status](https://travis-matrix-badges.herokuapp.com/repos/CMU-Perceptual-Computing-Lab/openpose/branches/master/8)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose) |
<!-- | **`Windows`**   | | | | | | -->
G
gineshidalgo99 已提交
12 13 14
<!--
Note: Currently using [travis-matrix-badges](https://github.com/bjfish/travis-matrix-badges) vs. traditional [![Build Status](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose.svg?branch=master)](https://travis-ci.org/CMU-Perceptual-Computing-Lab/openpose)
-->
B
Bikramjot Hanzra 已提交
15

G
gineshidalgo99 已提交
16
[OpenPose](https://github.com/CMU-Perceptual-Computing-Lab/openpose) represents the **first real-time multi-person system to jointly detect human body, hand, facial, and foot keypoints (in total 135 keypoints) on single images**.
17

G
Gines 已提交
18 19 20
<p align="center">
    <img src="doc/media/pose_face_hands.gif", width="480">
</p>
21

22 23 24
## Features
- **Functionality**:
    - **2D real-time multi-person keypoint detection**:
G
gineshidalgo99 已提交
25
        - 15 or 18 or **25-keypoint body/foot keypoint estimation**. **Running time invariant to number of detected people**.
26 27
        - **2x21-keypoint hand keypoint estimation**. Currently, **running time depends** on **number of detected people**.
        - **70-keypoint face keypoint estimation**. Currently, **running time depends** on **number of detected people**.
28
    - **3D real-time single-person keypoint detection**:
29 30 31 32 33
        - 3-D triangulation from multiple single views.
        - Synchronization of Flir cameras handled.
        - Compatible with Flir/Point Grey cameras, but provided C++ demos to add your custom input.
    - **Calibration toolbox**:
        - Easy estimation of distortion, intrinsic, and extrinsic camera parameters.
G
gineshidalgo99 已提交
34
    - **Single-person tracking** for further speed up or visual smoothing.
35
- **Input**: Image, video, webcam, Flir/Point Grey and IP camera. Included C++ demos to add your custom input.
G
gineshidalgo99 已提交
36
- **Output**: Basic image + keypoint display/saving (PNG, JPG, AVI, ...), keypoint saving (JSON, XML, YML, ...), and/or keypoints as array class.
37
- **OS**: Ubuntu (14, 16), Windows (8, 10), Mac OSX, Nvidia TX2.
38 39
- **Others**:
    - Available: command-line demo, C++ wrapper, and C++ API.
40 41
    - [**Python API**](doc/modules/python_module.md).
    - [**Unity Plugin**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_unity_plugin).
42
    - CUDA (Nvidia GPU), OpenCL (AMD GPU), and CPU versions.
G
gineshidalgo99 已提交
43

G
gineshidalgo99 已提交
44 45


46
## Latest Features
47 48 49
- Jan 2018: [**Unity plugin released**](https://github.com/CMU-Perceptual-Computing-Lab/openpose_unity_plugin)!
- Jan 2018: [**Improved Python API**](doc/modules/python_module.md) released! Including body, face, hands, and all the functionality of the C++ API!
- Dec 2018: [**Foot dataset**](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset) and [**new paper released**](https://arxiv.org/abs/1812.08008)!
G
gineshidalgo99 已提交
50
- Sep 2018: [**Experimental single-person tracker**](doc/quick_start.md#tracking) for further speed up or visual smoothing!
G
gineshidalgo99 已提交
51
- Jun 2018: [**Combined body-foot model released! 40% faster and 5% more accurate**](doc/installation.md)!
52
- Jun 2018: [**OpenCL/AMD graphic card version**](doc/installation.md) released!
G
gineshidalgo99 已提交
53
- Jun 2018: [**Calibration toolbox**](doc/modules/calibration_module.md) released!
Z
Zhe Cao 已提交
54

55
For further details, check [all released features](doc/released_features.md) and [release notes](doc/release_notes.md).
G
gineshidalgo99 已提交
56

G
gineshidalgo99 已提交
57 58


G
gineshidalgo99 已提交
59
## Results
60
### Body and Foot Estimation
61
<p align="center">
G
gineshidalgo99 已提交
62
    <img src="doc/media/dance_foot.gif", width="360">
63 64
</p>

65
### 3-D Reconstruction Module (Body, Foot, Face, and Hands)
G
gineshidalgo99 已提交
66
<p align="center">
G
gineshidalgo99 已提交
67
    <img src="doc/media/openpose3d.gif", width="360">
G
gineshidalgo99 已提交
68
</p>
G
gineshidalgo99 已提交
69

70
### Body, Foot, Face, and Hands Estimation
G
gineshidalgo99 已提交
71
<p align="center">
72
    <img src="doc/media/pose_face.gif", width="360">
G
gineshidalgo99 已提交
73 74
    <img src="doc/media/pose_hands.gif", width="360">
</p>
G
Gines 已提交
75

76
### Unity Plugin
G
gineshidalgo99 已提交
77
<p align="center">
78 79 80
    <img src="doc/media/unity_main.png", width="240">
    <img src="doc/media/unity_body_foot.png", width="240">
    <img src="doc/media/unity_hand_face.png", width="240">
G
gineshidalgo99 已提交
81 82
</p>

G
gineshidalgo99 已提交
83
### Runtime Analysis
84
Inference time comparison between the 3 available pose estimation libraries: OpenPose, Alpha-Pose (fast Pytorch version), and Mask R-CNN:
G
gineshidalgo99 已提交
85
<p align="center">
G
gineshidalgo99 已提交
86
    <img src="doc/media/openpose_vs_competition.png", width="360">
G
gineshidalgo99 已提交
87 88 89
</p>
This analysis was performed using the same images for each algorithm and a batch size of 1. Each analysis was repeated 1000 times and then averaged. This was all performed on a system with a Nvidia 1080 Ti and CUDA 8. Megvii (Face++) and MSRA GitHub repositories were excluded because they only provide pose estimation results given a cropped person. However, they suffer the same problem than Alpha-Pose and Mask R-CNN, their runtimes grow linearly with the number of people.

G
Gines 已提交
90 91


G
gineshidalgo99 已提交
92 93 94 95 96 97 98 99
## Contents
1. [Features](#features)
2. [Latest Features](#latest-features)
3. [Results](#results)
4. [Installation, Reinstallation and Uninstallation](#installation-reinstallation-and-uninstallation)
5. [Quick Start](#quick-start)
6. [Output](#output)
7. [Speeding Up OpenPose and Benchmark](#speeding-up-openpose-and-benchmark)
100 101 102 103 104
8. [Foot Dataset](#foot-dataset)
9. [Send Us Failure Cases and Feedback!](#send-us-failure-cases-and-feedback)
10. [Authors and Contributors](#authors-and-contributors)
11. [Citation](#citation)
12. [License](#license)
G
gineshidalgo99 已提交
105 106 107



G
gineshidalgo99 已提交
108
## Installation, Reinstallation and Uninstallation
109 110 111
**Windows portable version**: Simply download and use the latest version from the [Releases](https://github.com/CMU-Perceptual-Computing-Lab/openpose/releases) section.

Otherwise, check [doc/installation.md](doc/installation.md) for instructions on how to build OpenPose from source.
G
gineshidalgo99 已提交
112 113 114 115



## Quick Start
116
Most users do not need the OpenPose C++/Python API, but can simply use the OpenPose Demo:
G
gineshidalgo99 已提交
117

G
gineshidalgo99 已提交
118
- **OpenPose Demo**: To easily process images/video/webcam and display/save the results. See [doc/demo_overview.md](doc/demo_overview.md). E.g., run OpenPose in a video with:
G
gineshidalgo99 已提交
119 120 121 122 123 124
```
# Ubuntu
./build/examples/openpose/openpose.bin --video examples/media/video.avi
:: Windows - Portable Demo
bin\OpenPoseDemo.exe --video examples\media\video.avi
```
125

G
gineshidalgo99 已提交
126
- **Calibration toolbox**: To easily calibrate your cameras for 3-D OpenPose or any other stereo vision task. See [doc/modules/calibration_module.md](doc/modules/calibration_module.md).
127

G
gineshidalgo99 已提交
128
- **OpenPose C++ API**: If you want to read a specific input, and/or add your custom post-processing function, and/or implement your own display/saving, check the C++ API tutorial on [examples/tutorial_api_cpp/](examples/tutorial_api_cpp/) and [doc/library_introduction.md](doc/library_introduction.md). You can create your custom code on [examples/user_code/](examples/user_code/) and quickly compile it with CMake when compiling the whole OpenPose project. Quickly **add your custom code**: See [examples/user_code/README.md](examples/user_code/README.md) for further details.
G
gineshidalgo99 已提交
129

G
gineshidalgo99 已提交
130
- **OpenPose Python API**: Analogously to the C++ API, find the tutorial for the Python API on [examples/tutorial_api_python/](examples/tutorial_api_python/).
131

132
- **Adding an extra module**: Check [doc/library_add_new_module.md](./doc/library_add_new_module.md).
G
Gines 已提交
133 134 135

- **Standalone face or hand detector**:
    - **Face** keypoint detection **without body** keypoint detection: If you want to speed it up (but also reduce amount of detected faces), check the OpenCV-face-detector approach in [doc/standalone_face_or_hand_keypoint_detector.md](doc/standalone_face_or_hand_keypoint_detector.md).
G
gineshidalgo99 已提交
136
    - **Use your own face/hand detector**: You can use the hand and/or face keypoint detectors with your own face or hand detectors, rather than using the body detector. E.g., useful for camera views at which the hands are visible but not the body (OpenPose detector would fail). See [doc/standalone_face_or_hand_keypoint_detector.md](doc/standalone_face_or_hand_keypoint_detector.md).
137 138


G
gineshidalgo99 已提交
139 140 141

## Output
Output (format, keypoint index ordering, etc.) in [doc/output.md](doc/output.md).
G
gineshidalgo99 已提交
142 143 144



G
gineshidalgo99 已提交
145
## Speeding Up OpenPose and Benchmark
G
gineshidalgo99 已提交
146
Check the OpenPose Benchmark as well as some hints to speed up and/or reduce the memory requirements for OpenPose on [doc/speed_up_preserving_accuracy.md](doc/speed_up_preserving_accuracy.md).
G
Gines 已提交
147 148 149



150
## Foot Dataset
G
gineshidalgo99 已提交
151
Check the [foot dataset website](https://cmu-perceptual-computing-lab.github.io/foot_keypoint_dataset/) and new [OpenPose paper](https://arxiv.org/abs/1812.08008) for more information.
152 153 154



G
gineshidalgo99 已提交
155
## Send Us Failure Cases and Feedback!
G
gineshidalgo99 已提交
156 157
Our library is open source for research purposes, and we want to continuously improve it! So please, let us know if...

G
gineshidalgo99 已提交
158 159 160 161 162 163
1. ... you find videos or images where OpenPose does not seems to work well. Feel free to send them to openposecmu@gmail.com (email only for failure cases!), we will use them to improve the quality of the algorithm!
2. ... you find any bug (in functionality or speed).
3. ... you added some functionality to some class or some new Worker<T> subclass which we might potentially incorporate.
4. ... you know how to speed up or improve any part of the library.
5. ... you have a request about possible functionality.
6. ... etc.
G
gineshidalgo99 已提交
164

G
gineshidalgo99 已提交
165
Just comment on GitHub or make a pull request and we will answer as soon as possible! Send us an email if you use the library to make a cool demo or YouTube video!
G
gineshidalgo99 已提交
166 167 168



G
gineshidalgo99 已提交
169
## Authors and Contributors
170
OpenPose is authored by [Gines Hidalgo](https://www.gineshidalgo.com/), [Zhe Cao](http://www.andrew.cmu.edu/user/zhecao), [Tomas Simon](http://www.cs.cmu.edu/~tsimon), [Shih-En Wei](https://scholar.google.com/citations?user=sFQD3k4AAAAJ&hl=en), [Hanbyul Joo](http://www.cs.cmu.edu/~hanbyulj), and [Yaser Sheikh](http://www.cs.cmu.edu/~yaser). Currently, it is being maintained by [Gines Hidalgo](https://www.gineshidalgo.com/) and [Yaadhav Raaj](https://www.linkedin.com/in/yaadhavraaj). The [original CVPR 2017 repo](https://github.com/ZheC/Multi-Person-Pose-Estimation) includes Matlab and Python versions, as well as the training code. The body pose estimation work is based on [the original ECCV 2016 demo](https://github.com/CMU-Perceptual-Computing-Lab/caffe_rtpose).
G
gineshidalgo99 已提交
171

G
gineshidalgo99 已提交
172 173 174
In addition, OpenPose would not be possible without the [CMU Panoptic Studio dataset](http://domedb.perception.cs.cmu.edu/).

We would also like to thank all the people who helped OpenPose in any way. The main contributors are listed in [doc/contributors.md](doc/contributors.md).
G
gineshidalgo99 已提交
175 176 177 178



## Citation
H
Harshal Mittal 已提交
179
Please cite these papers in your publications if it helps your research (the face keypoint detector was trained using the procedure described in [Simon et al. 2017] for hands):
G
gineshidalgo99 已提交
180

181 182
    @inproceedings{cao2018openpose,
      author = {Zhe Cao and Gines Hidalgo and Tomas Simon and Shih-En Wei and Yaser Sheikh},
G
gineshidalgo99 已提交
183
      booktitle = {arXiv preprint arXiv:1812.08008},
184 185 186 187
      title = {Open{P}ose: realtime multi-person 2{D} pose estimation using {P}art {A}ffinity {F}ields},
      year = {2018}
    }

G
gineshidalgo99 已提交
188 189 190 191 192
    @inproceedings{cao2017realtime,
      author = {Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
      year = {2017}
193
    }
G
gineshidalgo99 已提交
194

G
gineshidalgo99 已提交
195 196 197 198 199
    @inproceedings{simon2017hand,
      author = {Tomas Simon and Hanbyul Joo and Iain Matthews and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Hand Keypoint Detection in Single Images using Multiview Bootstrapping},
      year = {2017}
200
    }
G
gineshidalgo99 已提交
201

G
gineshidalgo99 已提交
202 203 204 205 206
    @inproceedings{wei2016cpm,
      author = {Shih-En Wei and Varun Ramakrishna and Takeo Kanade and Yaser Sheikh},
      booktitle = {CVPR},
      title = {Convolutional pose machines},
      year = {2016}
207
    }
208

H
Harshal Mittal 已提交
209 210
Links to the papers:

G
gineshidalgo99 已提交
211
- [OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields](https://arxiv.org/abs/1812.08008)
H
Harshal Mittal 已提交
212 213 214 215
- [Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields](https://arxiv.org/abs/1611.08050)
- [Hand Keypoint Detection in Single Images using Multiview Bootstrapping](https://arxiv.org/abs/1704.07809)
- [Convolutional Pose Machines](https://arxiv.org/abs/1602.00134)

216 217


G
gineshidalgo99 已提交
218
## License
G
gineshidalgo99 已提交
219
OpenPose is freely available for free non-commercial use, and may be redistributed under these conditions. Please, see the [license](LICENSE) for further details. Interested in a commercial license? Check this [FlintBox link](https://flintbox.com/public/project/47343/). For commercial queries, use the `Directly Contact Organization` section from the [FlintBox link](https://flintbox.com/public/project/47343/) and also send a copy of that message to [Yaser Sheikh](http://www.cs.cmu.edu/~yaser/).