Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-term Human Motion Prediction with Scene Context

About

Human movement is goal-directed and influenced by the spatial layout of the objects in the scene. To plan future human motion, it is crucial to perceive the environment -- imagine how hard it is to navigate a new room with lights off. Existing works on predicting human motion do not pay attention to the scene context and thus struggle in long-term prediction. In this work, we propose a novel three-stage framework that exploits scene context to tackle this task. Given a single scene image and 2D pose histories, our method first samples multiple human motion goals, then plans 3D human paths towards each goal, and finally predicts 3D human pose sequences following each path. For stable training and rigorous evaluation, we contribute a diverse synthetic dataset with clean annotations. In both synthetic and real datasets, our method shows consistent quantitative and qualitative improvements over existing methods.

Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik• 2020

Related benchmarks

TaskDatasetResultRank
3D Path PredictionPROX
Path Error (0.5s)189
5
3D Path PredictionGTA-IM (test)
3D Path Error (0.5s, mm)104
5
3D Pose PredictionPROX
3D Pose Error (0.5s)190
5
3D Pose PredictionGTA-IM (test)
3D Pose Error (0.5s) (mm)91
5
Showing 4 of 4 rows

Other info

Follow for update