Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling
About
We present Generative Predictive Control (GPC), an inference-time method for improving pretrained behavior-cloning policies without retraining. GPC augments a frozen diffusion policy at deployment with an action-conditioned world model trained on expert demonstrations and random exploration rollouts. The world model predicts the consequences of action proposals generated by the diffusion policy and enables lightweight online planning that ranks and refines these proposals through model-based look-ahead. By combining a generative prior with predictive foresight, GPC enables test-time adaptation while keeping the original policy fixed. Across diverse robotic manipulation tasks, including state- and vision-based settings in both simulation and real-world experiments, GPC consistently outperforms standard behavior cloning and compares favorably with other inference-time adaptation baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robot Manipulation | LIBERO | Spatial Success Rate98 | 116 | |
| Robot Manipulation | SimplerEnv Fractal Dataset Google Robot (test) | Success Rate: Pick Coke Can97.5 | 4 | |
| Robot Manipulation | SimplerEnv Bridge Dataset WidowX Robot (test) | WidowX Spoon on Towel Success Rate64.5 | 4 |