On Sample-Efficient Generalized Planning via Learned Transition Models

About

Generalized planning studies the construction of solution strategies that generalize across families of planning problems sharing a common domain model, formally defined by a transition function $\gamma : S \times A \rightarrow S$. Classical approaches achieve such generalization through symbolic abstractions and explicit reasoning over $\gamma$. In contrast, recent Transformer-based planners, such as PlanGPT and Plansformer, largely cast generalized planning as direct action-sequence prediction, bypassing explicit transition modeling. While effective on in-distribution instances, these approaches typically require large datasets and model sizes, and often suffer from state drift in long-horizon settings due to the absence of explicit world-state evolution. In this work, we formulate generalized planning as a transition-model learning problem, in which a neural model explicitly approximates the successor-state function $\hat{\gamma} \approx \gamma$ and generates plans by rolling out symbolic state trajectories. Instead of predicting actions directly, the model autoregressively predicts intermediate world states, thereby learning the domain dynamics as an implicit world model. To study size-invariant generalization and sample efficiency, we systematically evaluate multiple state representations and neural architectures, including relational graph encodings. Our results show that learning explicit transition models yields higher out-of-distribution satisficing-plan success than direct action-sequence prediction in multiple domains, while achieving these gains with significantly fewer training instances and smaller models. This is an extended version of a short paper accepted at ICAPS 2026 under the same title.

Nitin Gupta, Vishal Pallagani, John A. Aydin, Biplav Srivastava• 2026

Related benchmarks

Task	Dataset	Result
Generalized Planning	VisitAll (Extrapolation)	Coverage87	17
Generalized Planning	Blocks (Extrapolation)	Coverage45	17
Generalized Planning	Gripper (Extrapolation)	Coverage25	17
Generalized Planning	Logistics (val)	Coverage17	17
Generalized Planning	Logistics (Interpolation)	Coverage78	17
Generalized Planning	VisitAll (val)	Coverage100	17
Generalized Planning	Gripper (Interpolation)	Coverage100	17
Generalized Planning	VisitAll (Interpolation)	Coverage100	17
Generalized Planning	Gripper (val)	Coverage100	17
Generalized Planning	Blocks (val)	Coverage100	17

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord