Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations
About
We present a method for estimating articulated human pose from a single static image based on a graphical model with novel pairwise relations that make adaptive use of local image measurements. More precisely, we specify a graphical model for human pose which exploits the fact the local image measurements can be used both to detect parts (or joints) and also to predict the spatial relationships between them (Image Dependent Pairwise Relations). These spatial relationships are represented by a mixture model. We use Deep Convolutional Neural Networks (DCNNs) to learn conditional probabilities for the presence of parts and their spatial relationships within image patches. Hence our model combines the representational flexibility of graphical models with the efficiency and statistical power of DCNNs. Our method significantly outperforms the state of the art methods on the LSP and FLIC datasets and also performs very well on the Buffy dataset without any training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Pose Estimation | LSP (test) | Head Accuracy91.8 | 102 | |
| Human Pose Estimation | J-HMDB sub | Head Accuracy78.7 | 49 | |
| Articulated Human Pose Estimation | LSP (test) | Upper Arms Accuracy69.7 | 28 | |
| Human Pose Estimation | FLIC (test) | -- | 17 | |
| Human Pose Estimation | LSP PC annotations (test) | Torso Accuracy0.96 | 16 | |
| Multi-person Pose Estimation | MPII Multi-Person Pose subset of 288 images | Head Accuracy65 | 13 | |
| Human Pose Estimation | LSP original (test) | Head Acc91.5 | 9 | |
| Human Pose Estimation | Buffy (test) | U.arms Score96.8 | 7 |