Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

iPad: Iterative Proposal-centric End-to-End Autonomous Driving

About

End-to-end (E2E) autonomous driving systems offer a promising alternative to traditional modular pipelines by reducing information loss and error accumulation, with significant potential to enhance both mobility and safety. However, most existing E2E approaches directly generate plans based on dense bird's-eye view (BEV) grid features, leading to inefficiency and limited planning awareness. To address these limitations, we propose iterative Proposal-centric autonomous driving (iPad), a novel framework that places proposals - a set of candidate future plans - at the center of feature extraction and auxiliary tasks. Central to iPad is ProFormer, a BEV encoder that iteratively refines proposals and their associated features through proposal-anchored attention, effectively fusing multi-view image data. Additionally, we introduce two lightweight, proposal-centric auxiliary tasks - mapping and prediction - that improve planning quality with minimal computational overhead. Extensive experiments on the NAVSIM and CARLA Bench2Drive benchmarks demonstrate that iPad achieves state-of-the-art performance while being significantly more efficient than prior leading methods.

Ke Guo, Haochen Liu, Xiaojun Wu, Jia Pan, Chen Lv• 2025

Related benchmarks

TaskDatasetResultRank
Autonomous DrivingNAVSIM v1 (test)
NC98.6
113
Autonomous Driving PlanningNAVSIM v1 (test)
NC98.6
59
PlanningNAVSIM (navtest)
NC98.6
53
End-to-end PlanningNAVSIM v1
NC0.992
32
End-to-end Autonomous DrivingBench2Drive
Driving Score60.52
31
Autonomous DrivingBench2Drive 220 routes across CARLA towns
Efficiency161.3
20
Autonomous DrivingBench2Drive base subset (50 clips for open-loop)
Avg L2 Error0.97
11
Open-loop Autonomous DrivingNAVSIM (val)
NC98.6
10
End-to-end DrivingNAVSIM v2 (test)
NC98.7
9
Showing 9 of 9 rows

Other info

Code

Follow for update