Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation
About
This paper proposes a new hybrid architecture that consists of a deep Convolutional Network and a Markov Random Field. We show how this architecture is successfully applied to the challenging problem of articulated human pose estimation in monocular images. The architecture can exploit structural domain constraints such as geometric relationships between body joint locations. We show that joint training of these two model paradigms improves performance and allows us to significantly outperform existing state-of-the-art techniques.
Jonathan Tompson, Arjun Jain, Yann LeCun, Christoph Bregler• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Pose Estimation | MPII (test) | Shoulder PCK91.9 | 314 | |
| Human Pose Estimation | LSP (test) | Head Accuracy90.6 | 102 | |
| Human Pose Estimation | MPII | Head Accuracy96.1 | 32 | |
| Articulated Human Pose Estimation | LSP (test) | Upper Arms Accuracy63 | 28 | |
| Human Pose Estimation | FLIC (test) | Elbow Acc50.2 | 17 | |
| Human Pose Estimation | LSP PC annotations (test) | Torso Accuracy0.903 | 16 | |
| Human Pose Estimation | FLIC | Elbow Accuracy93.1 | 4 |
Showing 7 of 7 rows