HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations
About
It is especially challenging to achieve real-time human motion tracking on a standalone VR Head-Mounted Display (HMD) such as Meta Quest and PICO. In this paper, we propose HMD-Poser, the first unified approach to recover full-body motions using scalable sparse observations from HMD and body-worn IMUs. In particular, it can support a variety of input scenarios, such as HMD, HMD+2IMUs, HMD+3IMUs, etc. The scalability of inputs may accommodate users' choices for both high tracking accuracy and easy-to-wear. A lightweight temporal-spatial feature learning network is proposed in HMD-Poser to guarantee that the model runs in real-time on HMDs. Furthermore, HMD-Poser presents online body shape estimation to improve the position accuracy of body joints. Extensive experimental results on the challenging AMASS dataset show that HMD-Poser achieves new state-of-the-art results in both accuracy and real-time performance. We also build a new free-dancing motion dataset to evaluate HMD-Poser's on-device performance and investigate the performance gap between synthetic data and real-captured sensor data. Finally, we demonstrate our HMD-Poser with a real-time Avatar-driving application on a commercial HMD. Our code and free-dancing motion dataset are available https://pico-ai-team.github.io/hmd-poser
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Motion generation from hand-tracking signal | GORP (Real HT) | MPJPE6.02 | 16 | |
| Full-body motion generation | GORP (Real MC) | MPJPE6.84 | 16 | |
| Full-body motion estimation (Hand Tracking) | A-P2 (test) | MPJRE5.6 | 8 | |
| Hand Tracking (HT) | A-P1 v1 (test) | MPJRE3.34 | 8 | |
| Motion Controllers (MC) Tracking | A-P1 v1 (test) | MPJRE2.77 | 8 | |
| Motion generation from hand-tracking signal | GORP Simulated HT | MPJPE6.05 | 8 | |
| Full-body motion estimation (Motion Controllers) | A-P2 (test) | MPJRE5.17 | 8 | |
| Human Motion Reconstruction | AMASS (Protocol 1) | MPJRE2.32 | 8 | |
| Full-body motion generation | GORP (Simulated MC) | MPJPE5.89 | 8 |