Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OPD: Single-view 3D Openable Part Detection

About

We address the task of predicting what parts of an object can open and how they move when they do so. The input is a single image of an object, and as output we detect what parts of the object can open, and the motion parameters describing the articulation of each openable part. To tackle this task, we create two datasets of 3D objects: OPDSynth based on existing synthetic objects, and OPDReal based on RGBD reconstructions of real objects. We then design OPDRCNN, a neural architecture that detects openable parts and predicts their motion parameters. Our experiments show that this is a challenging task especially when considering generalization across object categories, and the limited amount of information in a single image. Our architecture outperforms baselines and prior work especially for RGB image inputs. Short video summary at https://www.youtube.com/watch?v=P85iCaD0rfc

Hanxiao Jiang, Yongsen Mao, Manolis Savva, Angel X. Chang• 2022

Related benchmarks

TaskDatasetResultRank
Articulated Object ManipulationReal-robot manipulation trials Right Hinge
OSR30
9
Articulated Object ManipulationReal-robot manipulation trials Textured Hinge
OSR30
9
Articulated Object ManipulationReal-robot manipulation trials Mean across 50 tasks
Overall Success Rate (OSR)35
9
Articulated Object ManipulationReal-robot manipulation trials Prismatic Hinge
OSR50
9
Articulated Object ManipulationReal-robot manipulation trials Left Hinge
OSR30
9
Articulated Object Manipulation50 tasks in campus environments
Right Hinge Time (s)39.3
9
Motion Axis EstimationOPD real 12
Motion Axis Error6.67
6
Articulated Object Axis EstimationCampus-scale 50 tasks (test)
Right Hinge Axis EA-Score34.1
4
3D Object Articulation PredictionMultiScan (evaluation)
mIoU53.8
4
Motion Parameter EstimationACD curated (test)
MAE (Armoire)12.37
3
Showing 10 of 12 rows

Other info

Follow for update