Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling

About

We present Generative Predictive Control (GPC), an inference-time method for improving pretrained behavior-cloning policies without retraining. GPC augments a frozen diffusion policy at deployment with an action-conditioned world model trained on expert demonstrations and random exploration rollouts. The world model predicts the consequences of action proposals generated by the diffusion policy and enables lightweight online planning that ranks and refines these proposals through model-based look-ahead. By combining a generative prior with predictive foresight, GPC enables test-time adaptation while keeping the original policy fixed. Across diverse robotic manipulation tasks, including state- and vision-based settings in both simulation and real-world experiments, GPC consistently outperforms standard behavior cloning and compares favorably with other inference-time adaptation baselines.

Han Qi, Haocheng Yin, Aris Zhu, Yilun Du, Heng Yang• 2025

Related benchmarks

TaskDatasetResultRank
Robot ManipulationLIBERO
Spatial Success Rate98
116
Robot ManipulationSimplerEnv Fractal Dataset Google Robot (test)
Success Rate: Pick Coke Can97.5
4
Robot ManipulationSimplerEnv Bridge Dataset WidowX Robot (test)
WidowX Spoon on Towel Success Rate64.5
4
Showing 3 of 3 rows

Other info

Follow for update