Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning

About

End-to-end autonomous driving unifies tasks in a differentiable framework, enabling planning-oriented optimization and attracting growing attention. Current methods aggregate historical information either through dense historical bird's-eye-view (BEV) features or by querying a sparse memory bank, following paradigms inherited from detection. However, we argue that these paradigms either omit historical information in motion planning or fail to align with its multi-step nature, which requires predicting or planning multiple future time steps. In line with the philosophy of future is a continuation of past, we propose BridgeAD, which reformulates motion and planning queries as multi-step queries to differentiate the queries for each future time step. This design enables the effective use of historical prediction and planning by applying them to the appropriate parts of the end-to-end system based on the time steps, which improves both perception and motion planning. Specifically, historical queries for the current frame are combined with perception, while queries for future frames are integrated with motion planning. In this way, we bridge the gap between past and future by aggregating historical insights at every time step, enhancing the overall coherence and accuracy of the end-to-end autonomous driving pipeline. Extensive experiments on the nuScenes dataset in both open-loop and closed-loop settings demonstrate that BridgeAD achieves state-of-the-art performance.

Bozhou Zhang, Nan Song, Xin Jin, Li Zhang• 2025

Related benchmarks

TaskDatasetResultRank
3D Object DetectionnuScenes (val)
NDS59.4
981
Open-loop planningnuScenes (val)
L2 Error (3s)0.92
177
3D Multi-Object TrackingnuScenes (val)
AMOTA51.2
139
3D Object DetectionnuScenes (val)
mAP42.3
128
Closed-loop Autonomous DrivingBench2Drive
Driving Score (DS)50.06
49
PlanningnuScenes v1.0-trainval (val)
ST-P3 L2 Error (1s)0.28
39
Open-loop planningnuScenes v1.0-trainval (val)
L2 Error (1s)0.28
38
Open-loop planningBench2Drive
Average L2 Error0.71
36
Multi-Object TrackingnuScenes (val)
AMOTA39.8
33
Closed-loop simulationNeuroNCAP
NeuroNCAP Score (Avg)3.06
21
Showing 10 of 17 rows

Other info

Code

Follow for update