Recurrent Human Pose Estimation
About
We propose a novel ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an architecture combining a feed forward module with a recurrent module, where the recurrent module can be run iteratively to improve the performance, (ii) the model can be trained end-to-end and from scratch, with auxiliary losses incorporated to improve performance, (iii) we investigate whether keypoint visibility can also be predicted. The model is evaluated on two benchmark datasets. The result is a simple architecture that achieves performance on par with the state of the art, but without the complexity of a graphical model stage (or layers).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Pose Estimation | MPII (test) | Shoulder PCK95 | 314 | |
| Human Pose Estimation | LSP (test) | Head Accuracy95.2 | 102 | |
| Human Pose Estimation | MPII | Head Accuracy97.7 | 32 | |
| Human Pose Estimation | LSP PC annotations (test) | Torso Accuracy0.96 | 16 | |
| Human Pose Estimation | MPII pose 03/15/2018 (full) | Head Accuracy97.7 | 11 | |
| Human Pose Estimation | LSP person-centric (test) | Head Accuracy95.2 | 9 | |
| Human Pose Estimation | LSP extended (test) | -- | 8 | |
| Keypoint Detection | MPII (test) | Mean@0.588.1 | 5 |