Multi-Part Object Representations via Graph Structures and Co-Part Discovery
About
Discovering object-centric representations from images can significantly enhance the robustness, sample efficiency and generalizability of vision models. Works on images with multi-part objects typically follow an implicit object representation approach, which fail to recognize these learned objects in occluded or out-of-distribution contexts. This is due to the assumption that object part-whole relations are implicitly encoded into the representations through indirect training objectives. We address this limitation by proposing a novel method that leverages on explicit graph representations for parts and present a co-part object discovery algorithm. We then introduce three benchmarks to evaluate the robustness of object-centric methods in recognizing multi-part objects within occluded and out-of-distribution settings. Experimental results on simulated, realistic, and real-world images show marked improvements in the quality of discovered objects compared to state-of-the-art methods, as well as the accurate recognition of multi-part objects in occluded and out-of-distribution contexts. We also show that the discovered object-centric representations can more accurately predict key object properties in a downstream task, highlighting the potential of our method to advance the field of object-centric representations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Foreground Object Discovery | GSO (test) | ARI99.1 | 10 | |
| Foreground Object Discovery | SKU-110K (test) | ARI7.8 | 10 | |
| Foreground Object Discovery | Tetrominoes | ARI100 | 10 | |
| Foreground Object Discovery | AbsScene | ARI97.7 | 10 | |
| Foreground Object Discovery | Pascal VOC | ARI19 | 10 | |
| Foreground Object Discovery | MS-COCO | ARI38.9 | 10 | |
| Object Discovery | AbsScene-O | mDice93.1 | 10 | |
| Object Discovery | GSO-O | mDice90.3 | 10 | |
| Object Discovery | AbsScene-C (test) | ARI93.5 | 10 | |
| Object Property Prediction | Tetrominoes | Accuracy (Red)100 | 6 |