Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild

About

Human pose estimation from single images is a challenging problem in computer vision that requires large amounts of labeled training data to be solved accurately. Unfortunately, for many human activities (\eg outdoor sports) such training data does not exist and is hard or even impossible to acquire with traditional motion capture systems. We propose a self-supervised approach that learns a single image 3D pose estimator from unlabeled multi-view data. To this end, we exploit multi-view consistency constraints to disentangle the observed 2D pose into the underlying 3D pose and camera rotation. In contrast to most existing methods, we do not require calibrated cameras and can therefore learn from moving cameras. Nevertheless, in the case of a static camera setup, we present an optional extension to include constant relative camera rotations over multiple views into our framework. Key to the success are new, unbiased reconstruction objectives that mix information across views and training samples. The proposed approach is evaluated on two benchmark datasets (Human3.6M and MPII-INF-3DHP) and on the in-the-wild SkiPose dataset.

Bastian Wandt, Marco Rudolph, Petrissa Zell, Helge Rhodin, Bodo Rosenhahn• 2020

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)--
559
3D Human Pose EstimationHuman3.6M (test)--
547
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)--
180
3D Human Pose EstimationH36M (S9 and S11)
PA-MPJPE (P2)53
24
3D Human Pose EstimationSkiPose
PMPJPE89.6
12
3D Pose EstimationSkiPose (test)
PMPJPE90
12
3D Whole-body Pose EstimationH3WB (test)
MPJPE (All)0.117
9
3D Human Pose EstimationSkiPose-PTZ (test)
NMPJPE (mm)128
8
3D Human Pose EstimationSki-Pose (cross-dataset evaluation)
PA-MPJPE89.6
7
3D Human Pose EstimationSki
PA-MPJPE89.6
5
Showing 10 of 10 rows

Other info

Follow for update