Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SYMBOLIZER: Symbolic Model-free Task Planning with VLMs

About

Traditional Task and Motion Planning (TAMP) systems depend on physics models for motion planning and discrete symbolic models for task planning. Although physics model are often available, symbolic models (consisting of symbolic state interpretation and action models) must be meticulously handcrafted or learned from labeled data. This process is both resource-intensive and constrains the solution to the specific domain, limiting scalability and adaptability. On the other hand, Visual Language Models (VLMs) show desirable zero-shot visual understanding (due to their extensive training on heterogeneous data), but still achieve limited planning capabilities. Therefore, integrating VLMs with classical planning for long-horizon reasoning in TAMP problems offers high potential. Recent works in this direction still lack generality and depend on handcrafted, task-specific solutions, e.g. describing all possible objects in advance, or using symbolic action models. We propose a framework that generalizes well to unseen problem instances. The method requires only lifted predicates describing relations among objects and uses VLMs to ground them from images to obtain the symbolic state. Planning is performed with domain-independent heuristic search using goal-count and width-based heuristics, without need for action models. Symbolic search over VLM-grounded state-space outperforms direct VLM-based planning and performs on par with approaches that use a VLM-derived heuristic. This shows that domain-independent search can effectively solve problems across domains with large combinatorial state spaces. We extensively evaluate on extensively evaluate our method and achieve state-of-the-art results on the ProDG and ViPlan benchmarks.

Sami Azirar, Zlatan Ajanovic, Hermann Blum• 2026

Related benchmarks

TaskDatasetResultRank
Predicate GroundingProDG Blocks
F1 Score100
26
Predicate GroundingProDG Cooking
F1 Score100
21
Predicate GroundingProDG Hanoi
F1 Score97.3
19
Predicate GroundingPyBullet Blocks
F1 Score100
15
Predicate GroundingReal Images Blocksworld
F1 Score100
15
Object GroundingPDDLGym Blocksworld
F1 Score100
10
Predicate GroundingPyBullet Hanoi
F1 Score100
10
Predicate GroundingPDDLGym Hanoi Color
F1 Score100
10
Predicate GroundingKitchen-Worlds
F1 Score100
9
Goal GroundingProDG Hanoi
F171
7
Showing 10 of 27 rows

Other info

Follow for update