Motion Guided 3D Pose Estimation from Videos

About

We propose a new loss function, called motion loss, for the problem of monocular 3D Human pose estimation from 2D pose. In computing motion loss, a simple yet effective representation for keypoint motion, called pairwise motion encoding, is introduced. We design a new graph convolutional network architecture, U-shaped GCN (UGCN). It captures both short-term and long-term motion information to fully leverage the additional supervision from the motion loss. We experiment training UGCN with the motion loss on two large scale benchmarks: Human3.6M and MPI-INF-3DHP. Our model surpasses other state-of-the-art models by a large margin. It also demonstrates strong capacity in producing smooth 3D sequences and recovering keypoint motion.

Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin• 2020

Related benchmarks

Task	Dataset	Result
3D Human Pose Estimation	MPI-INF-3DHP (test)	PCK86.9	606
3D Human Pose Estimation	Human3.6M (test)	MPJPE (Average)1.4	570
3D Human Pose Estimation	Human3.6M (Protocol #1)	MPJPE (Avg.)25.6	457
3D Human Pose Estimation	Human3.6M (Protocol 2)	Average MPJPE32.7	315
3D Human Pose Estimation	Human3.6M	MPJPE44.5	197
3D Human Pose Estimation	Human3.6M Protocol 1 (test)	Dir. Error (Protocol 1)23	183
3D Human Pose Estimation	Human3.6M (subjects 9 and 11)	Average Error1.4	180
3D Human Pose Estimation	Human3.6M Protocol #2 (test)	Average Error35.5	140
3D Human Pose Estimation	MPI-INF-3DHP	MPJPE68.1	122
3D Human Pose Estimation	Human3.6M (S9, S11)	Average Error (MPJPE Avg)44.5	94

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord