Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

About

Reconstructing non-rigid objects with physical plausibility remains a significant challenge. Existing approaches leverage differentiable rendering for per-scene optimization, recovering geometry and dynamics but requiring expensive tuning or manual annotation, which limits practicality and generalizability. To address this, we propose ReconPhys, the first feedforward framework that jointly learns physical attribute estimation and 3D Gaussian Splatting reconstruction from a single monocular video. Our method employs a dual-branch architecture trained via a self-supervised strategy, eliminating the need for ground-truth physics labels. Given a video sequence, ReconPhys simultaneously infers geometry, appearance, and physical attributes. Experiments on a large-scale synthetic dataset demonstrate superior performance: our method achieves 21.64 PSNR in future prediction compared to 13.27 by state-of-the-art optimization baselines, while reducing Chamfer Distance from 0.349 to 0.004. Crucially, ReconPhys enables fast inference (<1 second) versus hours required by existing methods, facilitating rapid generation of simulation-ready assets for robotics and graphics.

Boyuan Wang, Xiaofeng Wang, Yongkang Li, Zheng Zhu, Yifan Chang, Angen Ye, Guosheng Zhao, Chaojun Ni, Guan Huang, Yijie Ren, Yueqi Duan, Xingang Wang• 2026

Related benchmarks

TaskDatasetResultRank
Dynamic ReconstructionSynthesized objects 46 unseen objects (test)
PSNR33.84
3
Future PredictionSynthesized objects 46 unseen objects (test)
PSNR21.64
2
Physical Property RecoveryReconPhys Dataset
Stiffness297.3
2
Showing 3 of 3 rows

Other info

GitHub

Follow for update