Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder

About

Although end-to-end autonomous driving (E2E-AD) technologies have made significant progress in recent years, there remains an unsatisfactory performance on closed-loop evaluation. The potential of leveraging planning in query design and interaction has not yet been fully explored. In this paper, we introduce a multi-granularity planning query representation that integrates heterogeneous waypoints, including spatial, temporal, and driving-style waypoints across various sampling patterns. It provides additional supervision for trajectory prediction, enhancing precise closed-loop control for the ego vehicle. Additionally, we explicitly utilize the geometric properties of planning trajectories to effectively retrieve relevant image features based on physical locations using deformable attention. By combining these strategies, we propose a novel end-to-end autonomous driving framework, termed HiP-AD, which simultaneously performs perception, prediction, and planning within a unified decoder. HiP-AD enables comprehensive interaction by allowing planning queries to iteratively interact with perception queries in the BEV space while dynamically extracting image features from perspective views. Experiments demonstrate that HiP-AD outperforms all existing end-to-end autonomous driving methods on the closed-loop benchmark Bench2Drive and achieves competitive performance on the real-world dataset nuScenes.

Yingqi Tang, Zhuoran Xu, Zhaotie Meng, Erkang Cheng• 2025

Related benchmarks

TaskDatasetResultRank
3D Multi-Object TrackingnuScenes (val)
AMOTA40.6
115
Closed-loop PlanningBench2Drive
Driving Score86.77
90
Open-loop planningnuScenes v1.0 (val)
L2 (1s)0.28
59
Object DetectionnuScenes (val)
mAP42.4
41
MotionnuScenes (val)
minADE0.61
34
Closed-loop PlanningBench2Drive (test)
Driving Score86.77
21
Online MappingnuScenes (val)--
15
PlanningBench2Drive open-loop (val)
Avg L2 Error0.69
11
End-to-end DrivingCARLA Bench2Drive v1 (test)
Driving Score86.8
11
End-to-end DrivingCARLA Longest6 v2 (test)
Driving Score7
11
Showing 10 of 12 rows

Other info

Code

Follow for update