Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

On Sample-Efficient Generalized Planning via Learned Transition Models

About

Generalized planning studies the construction of solution strategies that generalize across families of planning problems sharing a common domain model, formally defined by a transition function $\gamma : S \times A \rightarrow S$. Classical approaches achieve such generalization through symbolic abstractions and explicit reasoning over $\gamma$. In contrast, recent Transformer-based planners, such as PlanGPT and Plansformer, largely cast generalized planning as direct action-sequence prediction, bypassing explicit transition modeling. While effective on in-distribution instances, these approaches typically require large datasets and model sizes, and often suffer from state drift in long-horizon settings due to the absence of explicit world-state evolution. In this work, we formulate generalized planning as a transition-model learning problem, in which a neural model explicitly approximates the successor-state function $\hat{\gamma} \approx \gamma$ and generates plans by rolling out symbolic state trajectories. Instead of predicting actions directly, the model autoregressively predicts intermediate world states, thereby learning the domain dynamics as an implicit world model. To study size-invariant generalization and sample efficiency, we systematically evaluate multiple state representations and neural architectures, including relational graph encodings. Our results show that learning explicit transition models yields higher out-of-distribution satisficing-plan success than direct action-sequence prediction in multiple domains, while achieving these gains with significantly fewer training instances and smaller models. This is an extended version of a short paper accepted at ICAPS 2026 under the same title.

Nitin Gupta, Vishal Pallagani, John A. Aydin, Biplav Srivastava• 2026

Related benchmarks

TaskDatasetResultRank
Generalized PlanningVisitAll (Extrapolation)
Coverage87
17
Generalized PlanningBlocks (Extrapolation)
Coverage45
17
Generalized PlanningGripper (Extrapolation)
Coverage25
17
Generalized PlanningLogistics (val)
Coverage17
17
Generalized PlanningLogistics (Interpolation)
Coverage78
17
Generalized PlanningVisitAll (val)
Coverage100
17
Generalized PlanningGripper (Interpolation)
Coverage100
17
Generalized PlanningVisitAll (Interpolation)
Coverage100
17
Generalized PlanningGripper (val)
Coverage100
17
Generalized PlanningBlocks (val)
Coverage100
17
Showing 10 of 12 rows

Other info

Follow for update