Motion Guided 3D Pose Estimation from Videos
About
We propose a new loss function, called motion loss, for the problem of monocular 3D Human pose estimation from 2D pose. In computing motion loss, a simple yet effective representation for keypoint motion, called pairwise motion encoding, is introduced. We design a new graph convolutional network architecture, U-shaped GCN (UGCN). It captures both short-term and long-term motion information to fully leverage the additional supervision from the motion loss. We experiment training UGCN with the motion loss on two large scale benchmarks: Human3.6M and MPI-INF-3DHP. Our model surpasses other state-of-the-art models by a large margin. It also demonstrates strong capacity in producing smooth 3D sequences and recovering keypoint motion.
Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | MPI-INF-3DHP (test) | PCK86.9 | 559 | |
| 3D Human Pose Estimation | Human3.6M (test) | MPJPE (Average)1.4 | 547 | |
| 3D Human Pose Estimation | Human3.6M (Protocol #1) | MPJPE (Avg.)25.6 | 440 | |
| 3D Human Pose Estimation | Human3.6M (Protocol 2) | Average MPJPE32.7 | 315 | |
| 3D Human Pose Estimation | Human3.6M Protocol 1 (test) | Dir. Error (Protocol 1)23 | 183 | |
| 3D Human Pose Estimation | Human3.6M (subjects 9 and 11) | Average Error1.4 | 180 | |
| 3D Human Pose Estimation | Human3.6M | MPJPE44.5 | 160 | |
| 3D Human Pose Estimation | Human3.6M Protocol #2 (test) | Average Error35.5 | 140 | |
| 3D Human Pose Estimation | MPI-INF-3DHP | PCK86.9 | 108 | |
| 3D Human Pose Estimation | Human3.6M (S9, S11) | Average Error (MPJPE Avg)44.5 | 94 |
Showing 10 of 15 rows