Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder

About

Although end-to-end autonomous driving (E2E-AD) technologies have made significant progress in recent years, there remains an unsatisfactory performance on closed-loop evaluation. The potential of leveraging planning in query design and interaction has not yet been fully explored. In this paper, we introduce a multi-granularity planning query representation that integrates heterogeneous waypoints, including spatial, temporal, and driving-style waypoints across various sampling patterns. It provides additional supervision for trajectory prediction, enhancing precise closed-loop control for the ego vehicle. Additionally, we explicitly utilize the geometric properties of planning trajectories to effectively retrieve relevant image features based on physical locations using deformable attention. By combining these strategies, we propose a novel end-to-end autonomous driving framework, termed HiP-AD, which simultaneously performs perception, prediction, and planning within a unified decoder. HiP-AD enables comprehensive interaction by allowing planning queries to iteratively interact with perception queries in the BEV space while dynamically extracting image features from perspective views. Experiments demonstrate that HiP-AD outperforms all existing end-to-end autonomous driving methods on the closed-loop benchmark Bench2Drive and achieves competitive performance on the real-world dataset nuScenes.

Yingqi Tang, Zhuoran Xu, Zhaotie Meng, Erkang Cheng• 2025

Related benchmarks

TaskDatasetResultRank
3D Multi-Object TrackingnuScenes (val)
AMOTA40.6
139
Closed-loop PlanningBench2Drive
Driving Score86.77
137
Open-loop planningnuScenes v1.0 (val)
L2 (1s)0.28
71
MotionnuScenes (val)
minADE0.61
49
Object DetectionnuScenes (val)
mAP42.4
48
Open-loop planningnuScenes v1.0-trainval (val)
L2 Error (1s)0.28
38
Online MappingnuScenes (val)
mAP57.1
32
Closed-loop PlanningBench2Drive (test)
Driving Score86.77
21
End-to-end Autonomous DrivingBench2Drive v1.0 (Full-data)
Driving Score86.77
14
PlanningBench2Drive open-loop (val)
Avg L2 Error0.69
11
Showing 10 of 13 rows

Other info

Code

Follow for update