Estimating Body and Hand Motion in an Ego-sensed World

About

We present EgoAllo, a system for human motion estimation from a head-mounted device. Using only egocentric SLAM poses and images, EgoAllo guides sampling from a conditional diffusion model to estimate 3D body pose, height, and hand parameters that capture a device wearer's actions in the allocentric coordinate frame of the scene. To achieve this, our key insight is in representation: we propose spatial and temporal invariance criteria for improving model performance, from which we derive a head motion conditioning parameterization that improves estimation by up to 18%. We also show how the bodies estimated by our system can improve hand estimation: the resulting kinematic and temporal constraints can reduce world-frame errors in single-frame estimates by 40%. Project page: https://egoallo.github.io/

Brent Yi, Vickie Ye, Maya Zheng, Yunqi Li, Lea M\"uller, Georgios Pavlakos, Yi Ma, Jitendra Malik, Angjoo Kanazawa• 2024

Related benchmarks

Task	Dataset	Result
Hand Pose Estimation	EgoExo4D 1.0 (test)	PA-MPJPE (mm)14.38	13
Egocentric Motion Reconstruction	EE4D-Motion (test)	Semantic Alignment Score0.869	11
Body estimation	AMASS (test)	MPJPE (mm)119.7	8
Body estimation	RICH	MPJPE176.2	8
Body estimation	Aria Digital Twins	MPJPE155.1	8
Egocentric Motion Reconstruction	Noisy EE4D-Motion (test)	Semantic Alignment Score84.6	8
Egocentric Pose Estimation	HMD Setting 90° FoV	MPJPE113.8	6
Egocentric Pose Estimation	HMD Setting 180° FoV	MPJPE90.67	6
3D Articulated Pose Estimation	Dataset 2	MPJPE (cm)10	5
3D Articulated Pose Estimation	Dataset 3	MPJPE (cm)11.7	5

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord