GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts
About
For years, researchers have been devoted to generalizable object perception and manipulation, where cross-category generalizability is highly desired yet underexplored. In this work, we propose to learn such cross-category skills via Generalizable and Actionable Parts (GAParts). By identifying and defining 9 GAPart classes (lids, handles, etc.) in 27 object categories, we construct a large-scale part-centric interactive dataset, GAPartNet, where we provide rich, part-level annotations (semantics, poses) for 8,489 part instances on 1,166 objects. Based on GAPartNet, we investigate three cross-category tasks: part segmentation, part pose estimation, and part-based object manipulation. Given the significant domain gaps between seen and unseen object categories, we propose a robust 3D segmentation method from the perspective of domain generalization by integrating adversarial learning techniques. Our method outperforms all existing methods by a large margin, no matter on seen or unseen categories. Furthermore, with part segmentation and pose estimation results, we leverage the GAPart pose definition to design part-based manipulation heuristics that can generalize well to unseen object categories in both the simulator and the real world. Our dataset, code, and demos are available on our project page.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Articulated Object Joint Estimation | PartNet-Mobility Simulated Dataset Seen Categories v0 (test) | Orientation Error (°)7.28 | 33 | |
| Articulated Object Joint Estimation | PartNet-Mobility Simulated Dataset Unseen Categories v0 (test) | Orientation Error (deg)10.63 | 12 | |
| Articulated Object Joint Estimation | Real world dataset Seen categories 1.0 (seen_objects) | Axis Orientation Error (°)18.62 | 12 | |
| Part Pose Estimation | GAPartNet Seen Object Categories | Rotation Error (Re)9.9 | 7 | |
| Motion Axis Estimation | OPD real 12 | Motion Axis Error6.31 | 6 | |
| Articulated Object Joint Estimation | Real world dataset Unseen categories 1.0 | Axis Orientation Error (°)19.17 | 6 | |
| Robot Manipulation | FrankaKitchen, PartManip, and ManiSkill simulation benchmarks (test) | T01 Success Rate73.3 | 6 | |
| Part Segmentation | GAPartNet (Seen Categories) | AP50 (Ln.F.Hl.)89.2 | 4 | |
| Part Segmentation | GAPartNet (Unseen Categories) | AP50 (Ln.F.Hl.)45.6 | 4 | |
| Part Pose Estimation | RGBD-Art (unseen object categories) | Re90.81 | 4 |