Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery

About

Human mesh recovery (HMR) models 3D human body from monocular videos, with recent works extending it to world-coordinate human trajectory and motion reconstruction. However, most existing methods remain offline, relying on future frames or global optimization, which limits their applicability in interactive feedback and perception-action loop scenarios such as AR/VR and telepresence. To address this, we propose OnlineHMR, a fully online framework that jointly satisfies four essential criteria of online processing, including system-level causality, faithfulness, temporal consistency, and efficiency. Built upon a two-branch architecture, OnlineHMR enables streaming inference via a causal key-value cache design and a curated sliding-window learning strategy. Meanwhile, a human-centric incremental SLAM provides online world-grounded alignment under physically plausible trajectory correction. Experimental results show that our method achieves performance comparable to existing chunk-based approaches on the standard EMDB benchmark and highly dynamic custom videos, while uniquely supporting online processing. Page and code are available at https://tsukasane.github.io/Video-OnlineHMR/.

Yiwen Zhao, Ce Zheng, Yufu Wang, Hsueh-Han Daniel Yang, Liting Wen, Laszlo A. Jeni• 2026

Related benchmarks

TaskDatasetResultRank
3D Human Mesh Recovery3DPW (test)
MPJPE69.9
299
Human global trajectory and motion reconstructionEMDB 2
WA-MPJPE10093.5
17
Camera-coordinate Human Mesh RecoveryEMDB-1 (test)
PA-MPJPE46
13
World-coordinate Human Mesh RecoveryEMDB-2 v1.0 (test)
FPS3.3
8
Showing 4 of 4 rows

Other info

Follow for update