Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LAP: Fast LAtent Diffusion Planner with Fine-Grained Feature Distillation for Autonomous Driving

About

Diffusion models have demonstrated strong capabilities for modeling human-like driving behaviors in autonomous driving, but their iterative sampling process induces substantial latency, and operating directly on raw trajectory points forces the model to spend capacity on low-level kinematics, rather than high-level multi-modal semantics. To address these limitations, we propose LAtent Planner (LAP), a framework that plans in a VAE-learned latent space that disentangles high-level intents from low-level kinematics, enabling our planner to capture rich, multi-modal driving strategies. We further introduce a fine-grained feature distillation mechanism to guide a better interaction and fusion between the high-level semantic planning space and the vectorized scene context. Notably, LAP can produce high-quality plans in one single denoising step, substantially reducing computational overhead. Through extensive evaluations on the large-scale nuPlan benchmark, LAP achieves state-of-the-art closed-loop performance among learning-based planning methods, while demonstrating an inference speed-up of at most 10 times over previous SOTA approaches. Code will be released at: https://github.com/jhz1192/Latent-Planner.

Jinhao Zhang, Wenlong Xia, Zhexuan Zhou, Youmin Gong, Jie Mei• 2025

Related benchmarks

TaskDatasetResultRank
Closed-loop PlanningnuPlan 14 (val)--
66
Closed-loop PlanningnuPlan 14 Hard (test)--
64
Closed-loop PlanningnuPlan 14 (test)--
45
Motion PlanningnuPlan
Score78.52
6
Showing 4 of 4 rows

Other info

Follow for update