Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Motion Guided 3D Pose Estimation from Videos

About

We propose a new loss function, called motion loss, for the problem of monocular 3D Human pose estimation from 2D pose. In computing motion loss, a simple yet effective representation for keypoint motion, called pairwise motion encoding, is introduced. We design a new graph convolutional network architecture, U-shaped GCN (UGCN). It captures both short-term and long-term motion information to fully leverage the additional supervision from the motion loss. We experiment training UGCN with the motion loss on two large scale benchmarks: Human3.6M and MPI-INF-3DHP. Our model surpasses other state-of-the-art models by a large margin. It also demonstrates strong capacity in producing smooth 3D sequences and recovering keypoint motion.

Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin• 2020

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK86.9
559
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)1.4
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)25.6
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE32.7
315
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)23
183
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)
Average Error1.4
180
3D Human Pose EstimationHuman3.6M
MPJPE44.5
160
3D Human Pose EstimationHuman3.6M Protocol #2 (test)
Average Error35.5
140
3D Human Pose EstimationMPI-INF-3DHP
PCK86.9
108
3D Human Pose EstimationHuman3.6M (S9, S11)
Average Error (MPJPE Avg)44.5
94
Showing 10 of 15 rows

Other info

Code

Follow for update