Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

3D Human Motion Estimation via Motion Compression and Refinement

About

We develop a technique for generating smooth and accurate 3D human pose and motion estimates from RGB video sequences. Our method, which we call Motion Estimation via Variational Autoencoder (MEVA), decomposes a temporal sequence of human motion into a smooth motion representation using auto-encoder-based motion compression and a residual representation learned through motion refinement. This two-step encoding of human motion captures human motion in two stages: a general human motion estimation step that captures the coarse overall motion, and a residual estimation that adds back person-specific motion details. Experiments show that our method produces both smooth and accurate 3D human pose and motion estimates.

Zhengyi Luo, S. Alireza Golestaneh, Kris M. Kitani• 2020

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)--
559
3D Human Pose EstimationHuman3.6M (test)--
547
3D Human Pose Estimation3DPW (test)
PA-MPJPE54.7
505
3D Human Pose and Shape Estimation3DPW (test)
MPJPE-PA54.7
158
3D Human Mesh RecoveryHuman3.6M (test)
PA-MPJPE53.2
120
3D Human Pose and Shape EstimationHuman3.6M (test)
PA-MPJPE53.2
119
3D Human Pose and Shape Estimation3DPW
PA-MPJPE54.7
74
3D Human Pose and Shape EstimationMPI-INF-3DHP (test)
MPJPE96.4
46
3D Human Mesh Estimation3DPW (test)
PA-MPJPE54.7
44
3D Human Pose and Mesh Recovery3DPW
PA-MPJPE54.7
40
Showing 10 of 19 rows

Other info

Follow for update