Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling

About

We present Generative Predictive Control (GPC), an inference-time method for improving pretrained behavior-cloning policies without retraining. GPC augments a frozen diffusion policy at deployment with an action-conditioned world model trained on expert demonstrations and random exploration rollouts. The world model predicts the consequences of action proposals generated by the diffusion policy and enables lightweight online planning that ranks and refines these proposals through model-based look-ahead. By combining a generative prior with predictive foresight, GPC enables test-time adaptation while keeping the original policy fixed. Across diverse robotic manipulation tasks, including state- and vision-based settings in both simulation and real-world experiments, GPC consistently outperforms standard behavior cloning and compares favorably with other inference-time adaptation baselines.

Han Qi, Haocheng Yin, Aris Zhu, Yilun Du, Heng Yang• 2025

Related benchmarks

Task	Dataset	Result
Robot Manipulation	LIBERO	Spatial Success Rate98	223
Robot Manipulation	SimplerEnv Fractal Dataset Google Robot (test)	Success Rate: Pick Coke Can97.5	4
Robot Manipulation	SimplerEnv Bridge Dataset WidowX Robot (test)	WidowX Spoon on Towel Success Rate64.5	4

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord