Human Pose Estimation with Iterative Error Feedback

About

Hierarchical feature extractors such as Convolutional Networks (ConvNets) have achieved impressive performance on a variety of classification tasks using purely feedforward processing. Feedforward architectures can learn rich representations of the input space but do not explicitly model dependencies in the output spaces, that are quite structured for tasks such as articulated human pose estimation or object segmentation. Here we propose a framework that expands the expressive power of hierarchical feature extractors to encompass both input and output spaces, by introducing top-down feedback. Instead of directly predicting the outputs in one go, we use a self-correcting model that progressively changes an initial solution by feeding back error predictions, in a process we call Iterative Error Feedback (IEF). IEF shows excellent performance on the task of articulated pose estimation in the challenging MPII and LSP benchmarks, matching the state-of-the-art without requiring ground truth scale annotation.

Joao Carreira, Pulkit Agrawal, Katerina Fragkiadaki, Jitendra Malik• 2015

Related benchmarks

Task	Dataset	Result
Human Pose Estimation	MPII (test)	Shoulder PCK91.7	350
Human Pose Estimation	LSP (test)	Head Accuracy90.5	102
Human Pose Estimation	MPII	Head Accuracy96.3	32
Articulated Human Pose Estimation	LSP (test)	Upper Arms Accuracy66.7	28
3D Human Pose Estimation	ITOP top-view	Head Accuracy83.8	23
3D Human Pose Estimation	ITOP front-view	Head Joint Accuracy96.2	22
Human Pose Estimation	LSP PC annotations (test)	Torso Accuracy0.953	16
3D Human Pose Estimation	ITOP front-view 1.0	Head Accuracy96.2	4
Body Part Detection	Viewpoint Transfer Task Dataset (test)	Head Detection Rate47.9	4
3D Human Pose Estimation	ITOP top-view 1.0	Head83.8	4

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord