Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

iTACO: Interactable Digital Twins of Articulated Objects from Casually Captured RGBD Videos

About

Articulated objects are prevalent in daily life. Interactable digital twins of such objects have numerous applications in embodied AI and robotics. Unfortunately, current methods to digitize articulated real-world objects require carefully captured data, preventing practical, scalable, and generalizable acquisition. We focus on motion analysis and part-level segmentation of an articulated object from a casually captured RGBD video shot with a hand-held camera. A casually captured video of an interaction with an articulated object is easy to obtain at scale using smartphones. However, this setting is challenging due to simultaneous object and camera motion and significant occlusions as the person interacts with the object. To tackle these challenges, we introduce iTACO: a coarse-to-fine framework that infers joint parameters and segments movable parts of the object from a dynamic RGBD video. To evaluate our method under this new setting, we build a dataset of 784 videos containing 284 objects across 11 categories that is 20$\times$ larger than available in prior work. We then compare our approach with existing methods that also take video as input. Our experiments show that iTACO outperforms existing articulated object digital twin methods on both synthetic and real casually captured RGBD videos.

Weikun Peng, Jun Lv, Cewu Lu, Manolis Savva• 2025

Related benchmarks

TaskDatasetResultRank
ReconstructionVideo2Articulation-S
CD-w0.01
8
Free-moving Articulated Object ReconstructionFreeArt-21 Prismatic 1.0 (test)
Axis Error (deg)18.47
7
Free-moving Articulated Object ReconstructionFreeArt-21 Revolute 1.0 (test)
Axis Alignment Error (deg)20
7
TrackingArticulat3D-Sim
EPE0.14
5
Joint EstimationVideo2Articulation-S
Axis Error16.05
4
Joint Estimation (Prismatic)Video2Articulation-S
Axis Angle (deg)13.75
4
Joint Estimation (Revolute)Video2Articulation-S
Axis Error (deg)18.34
4
ReconstructionArticulat3D-Sim
CD-w3.13
4
Joint EstimationArticulat3D-Sim
Axis Error48.5
4
Camera pose estimationVideo2Articulation-S
Rotational Error (deg)4.621
2
Showing 10 of 17 rows

Other info

Follow for update