Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DriveLaW:Unifying Planning and Video Generation in a Latent Driving World

About

World models have become crucial for autonomous driving, as they learn how scenarios evolve over time to address the long-tail challenges of the real world. However, current approaches relegate world models to limited roles: they operate within ostensibly unified architectures that still keep world prediction and motion planning as decoupled processes. To bridge this gap, we propose DriveLaW, a novel paradigm that unifies video generation and motion planning. By directly injecting the latent representation from its video generator into the planner, DriveLaW ensures inherent consistency between high-fidelity future generation and reliable trajectory planning. Specifically, DriveLaW consists of two core components: DriveLaW-Video, our powerful world model that generates high-fidelity forecasting with expressive latent representations, and DriveLaW-Act, a diffusion planner that generates consistent and reliable trajectories from the latent of DriveLaW-Video, with both components optimized by a three-stage progressive training strategy. The power of our unified paradigm is demonstrated by new state-of-the-art results across both tasks. DriveLaW not only advances video prediction significantly, surpassing best-performing work by 33.3% in FID and 1.8% in FVD, but also achieves a new record on the NAVSIM planning benchmark.

Tianze Xia, Yongkang Li, Lijun Zhou, Jingfeng Yao, Kaixin Xiong, Haiyang Sun, Bing Wang, Kun Ma, Guang Chen, Hangjun Ye, Wenyu Liu, Xinggang Wang• 2025

Related benchmarks

TaskDatasetResultRank
Autonomous Driving PlanningNAVSIM v1 (test)
NC99
118
Video GenerationnuScenes (val)
FVD81.3
72
PlanningNAVSIM (test)
PDMS89.1
59
Autonomous Driving PlanningNAVSIM v2 (Navtest)
NC98.7
48
Trajectory PlanningNAVSIM v2 (navhard)
NC Rate97.3
43
Autonomous Driving PlanningNAVSIM v1 (navtest)
NC99
24
Trajectory PlanningNAVSIM v1 (test)
PDMS89.1
24
Closed-loop PlanningNAVSIM v1 (test)
PDMS89.1
20
Closed-loop PlanningNAVSIM Navtest (test)
PDMS89.1
16
Video GenerationnuPlan
FVD55.6
8
Showing 10 of 10 rows

Other info

Follow for update