DreamGen: Unlocking Generalization in Robot Learning through Video World Models

About

We introduce DreamGen, a simple yet highly effective 4-stage pipeline for training robot policies that generalize across behaviors and environments through neural trajectories - synthetic robot data generated from video world models. DreamGen leverages state-of-the-art image-to-video generative models, adapting them to the target robot embodiment to produce photorealistic synthetic videos of familiar or novel tasks in diverse environments. Since these models generate only videos, we recover pseudo-action sequences using either a latent action model or an inverse-dynamics model (IDM). Despite its simplicity, DreamGen unlocks strong behavior and environment generalization: a humanoid robot can perform 22 new behaviors in both seen and unseen environments, while requiring teleoperation data from only a single pick-and-place task in one environment. To evaluate the pipeline systematically, we introduce DreamGen Bench, a video generation benchmark that shows a strong correlation between benchmark performance and downstream policy success. Our work establishes a promising new axis for scaling robot learning well beyond manual data collection. Code available at https://github.com/NVIDIA/GR00T-Dreams.

Joel Jang, Seonghyeon Ye, Zongyu Lin, Jiannan Xiang, Johan Bjorck, Yu Fang, Fengyuan Hu, Spencer Huang, Kaushil Kundalia, Yen-Chen Lin, Loic Magne, Ajay Mandlekar, Avnish Narayan, You Liang Tan, Guanzhi Wang, Jing Wang, Qi Wang, Yinzhen Xu, Xiaohui Zeng, Kaiyuan Zheng, Ruijie Zheng, Ming-Yu Liu, Luke Zettlemoyer, Dieter Fox, Jan Kautz, Scott Reed, Yuke Zhu, Linxi Fan• 2025

Related benchmarks

Task	Dataset	Result
Robot Policy Learning	LIBERO	S (Spatial) Rate99.1	73
Robotic Manipulation	RoboCasa	Average Success Rate20.6	68
Robotic Video Generation	R-Bench	Average Score42	44
Robotic Manipulation	RoboCasa Kitchen	Success Rate57.6	22
Robot Tabletop Manipulation	GR-1 Tabletop	Average Success Rate32.2	13
Kitchen manipulation	RoboCasa 24 kitchen manipulation tasks	Average Success Rate57.6	12
Pick-&-Place	Real-robot pick-and-place	Cube → Pad Successful Trials14	5
Dexterous Manipulation	DexMimicGen (test)	Success Rate (GR-1 Humanoid)57.1	4
Tabletop manipulation	ALLEX humanoid (seen unseen tasks)	Success Rate (P&P Can, In-distribution)37.5	3

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord