Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation

About

This paper proposes a new hybrid architecture that consists of a deep Convolutional Network and a Markov Random Field. We show how this architecture is successfully applied to the challenging problem of articulated human pose estimation in monocular images. The architecture can exploit structural domain constraints such as geometric relationships between body joint locations. We show that joint training of these two model paradigms improves performance and allows us to significantly outperform existing state-of-the-art techniques.

Jonathan Tompson, Arjun Jain, Yann LeCun, Christoph Bregler• 2014

Related benchmarks

Task	Dataset	Result
Human Pose Estimation	MPII (test)	Shoulder PCK91.9	350
Human Pose Estimation	LSP (test)	Head Accuracy90.6	102
Human Pose Estimation	MPII	Head Accuracy96.1	32
Articulated Human Pose Estimation	LSP (test)	Upper Arms Accuracy63	28
Human Pose Estimation	FLIC (test)	Elbow Acc50.2	17
Human Pose Estimation	LSP PC annotations (test)	Torso Accuracy0.903	16
Human Pose Estimation	FLIC	Elbow Accuracy93.1	4

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord