Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

About

End-to-end (E2E) autonomous driving methods still struggle to make correct decisions in interactive closed-loop evaluation due to limited causal reasoning capability. Current methods attempt to leverage the powerful understanding and reasoning abilities of Vision-Language Models (VLMs) to resolve this dilemma. However, the problem is still open that few VLMs for E2E methods perform well in the closed-loop evaluation due to the gap between the semantic reasoning space and the purely numerical trajectory output in the action space. To tackle this issue, we propose ORION, a holistic E2E autonomous driving framework by vision-language instructed action generation. ORION uniquely combines a QT-Former to aggregate long-term history context, a Large Language Model (LLM) for driving scenario reasoning, and a generative planner for precision trajectory prediction. ORION further aligns the reasoning space and the action space to implement a unified E2E optimization for both visual question-answering (VQA) and planning tasks. Our method achieves an impressive closed-loop performance of 77.74 Driving Score (DS) and 54.62% Success Rate (SR) on the challenge Bench2Drive datasets, which outperforms state-of-the-art (SOTA) methods by a large margin of 14.28 DS and 19.61% SR.

Haoyu Fu, Diankun Zhang, Zongchuang Zhao, Jianfeng Cui, Dingkang Liang, Chong Zhang, Dingyuan Zhang, Hongwei Xie, Bing Wang, Xiang Bai• 2025

Related benchmarks

TaskDatasetResultRank
Closed-loop PlanningBench2Drive
Driving Score77.74
90
Open-loop planningnuScenes v1.0 (val)
L2 (1s)0.17
59
PlanningnuScenes (val)
Collision Rate (Avg)37
52
End-to-end Autonomous DrivingBench2Drive base set
Driving Score77.74
46
PlanningnuScenes v1.0-trainval (val)
ST-P3 L2 Error (1s)0.17
39
Open-loop planningNuScenes v1.0 (test)
L2 Error (1s)0.17
28
Closed-loop Autonomous DrivingBench2Drive
Driving Score (DS)77.74
21
Autonomous DrivingBench2Drive base (train)
Driving Score77.74
19
Autonomous DrivingBench2Drive base set closed-loop
Driving Score (DS)0.7774
15
Open-loop planningBench2Drive base set open-loop
Average L2 Error0.68
15
Showing 10 of 13 rows

Other info

Code

Follow for update