Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

sim2art: Accurate Articulated Object Modeling from a Single Video using Synthetic Training Data Only

About

Understanding articulated objects is a fundamental challenge in robotics and digital twin creation. To effectively model such objects, it is essential to recover both part segmentation and the underlying joint parameters. Despite the importance of this task, previous work has largely focused on setups like multi-view systems, object scanning, or static cameras. In this paper, we present the first data-driven approach that jointly predicts part segmentation and joint parameters from monocular video captured with a freely moving camera. Trained solely on synthetic data, our method demonstrates strong generalization to real-world objects, offering a scalable and practical solution for articulated object understanding. Our approach operates directly on casually recorded video, making it suitable for real-time applications in dynamic environments. Project webpage: https://aartykov.github.io/sim2art/

Arslan Artykov, Corentin Sautier, Vincent Lepetit• 2025

Related benchmarks

TaskDatasetResultRank
Joint Axis PredictionSynthetic dataset
Axis Angle (°)4.54
4
Joint Type ClassificationSynthetic dataset
Type Accuracy0.992
4
Part SegmentationSynthetic dataset
mIoU91
3
Part Motion PredictionSynthetic dataset
Part Rotation (Deg)3.48
2
Showing 4 of 4 rows

Other info

Follow for update