Learning Discrete Abstractions for Visual Rearrangement Tasks Using Vision-Guided Graph Coloring
About
Learning abstractions directly from data is a core challenge in robotics. Humans naturally operate at an abstract level, reasoning over high-level subgoals while delegating execution to low-level motor skills -- an ability that enables efficient problem solving in complex environments. In robotics, abstractions and hierarchical reasoning have long been central to planning, yet they are typically hand-engineered, demanding significant human effort and limiting scalability. Automating the discovery of useful abstractions directly from visual data would make planning frameworks more scalable and more applicable to real-world robotic domains. In this work, we focus on rearrangement tasks where the state is represented with raw images, and propose a method to induce discrete, graph-structured abstractions by combining structural constraints with an attention-guided visual distance. Our approach leverages the inherent bipartite structure of rearrangement problems, integrating structural constraints and visual embeddings into a unified framework. This enables the autonomous discovery of abstractions from vision alone, which can subsequently support high-level planning. We evaluate our method on two rearrangement tasks in simulation and show that it consistently identifies meaningful abstractions that facilitate effective planning and outperform existing approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Task Planning and Graph Learning | Multi Fruit 2x3 | Opt66 | 3 | |
| Visual Task Planning and Graph Learning | Blocks 2 | Opt93.7 | 3 | |
| Visual Task Planning and Graph Learning | Blocks 3 | Opt72 | 3 | |
| Visual Task Planning and Graph Learning | Fruit-4x6 | Opt Rate78.1 | 3 | |
| Visual Task Planning and Graph Learning | Multi Fruit 4x6 | Opt71.8 | 3 | |
| Visual Task Planning and Graph Learning | Fruit 6x8 | Opt64.8 | 3 | |
| Visual Task Planning and Graph Learning | Fruit-2x3 | Optimality Score100 | 3 |