LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends

About

With the broader adoption and highly successful development of Large Language Models (LLMs), there has been growing interest and demand for applying LLMs to autonomous driving technology. Driven by their natural language understanding and reasoning capabilities, LLMs have the potential to enhance various aspects of autonomous driving systems, from perception and scene understanding to interactive decision-making. This paper first introduces the novel concept of designing Large Language Models for Autonomous Driving (LLM4AD), followed by a review of existing LLM4AD studies. Then, a comprehensive benchmark is proposed for evaluating the instruction-following and reasoning abilities of LLM4AD systems, which includes LaMPilot-Bench, CARLA Leaderboard 1.0 Benchmark in simulation and NuPlanQA for multi-view visual question answering. Furthermore, extensive real-world experiments are conducted on autonomous vehicle platforms, examining both on-cloud and on-edge LLM deployment for personalized decision-making and motion control. Next, the future trends of integrating language diffusion models into autonomous driving are explored, exemplified by the proposed ViLaD (Vision-Language Diffusion) framework. Finally, the main challenges of LLM4AD are discussed, including latency, deployment, security and privacy, safety, trust and transparency, and personalization.

Can Cui, Yunsheng Ma, Sung-Yeon Park, Zichong Yang, Yupeng Zhou, Peiran Liu, Juanwu Lu, Juntong Peng, Jiaru Zhang, Ruqi Zhang, Lingxi Li, Yaobin Chen, Jitesh H. Panchal, Amr Abdelraouf, Rohit Gupta, Kyungtae Han, Ziran Wang• 2024

Related benchmarks

Task	Dataset	Result
End-to-end Planning	nuScenes	L2 Error (3s)2.69	45
Planning	nuScenes v1.0-trainval (val)	--	39
Autonomous Driving	CARLA Leaderboard official 1.0 (test)	Driving Score65.4	20
Multi-modal Reasoning	NuPlanQA EVAL	Traffic Light Accuracy61.1	18
Autonomous Driving	LaMPilot-Bench	--	18
Driving Performance Validation	Driving Performance Lane Change Scenario (val)	Time to Collision (s)2.15	3
Driving Performance Validation	Driving Performance Acceleration Scenario (val)	Time to Collision (s)2.46	3
Driving Performance Validation	Driving Performance Left Turn Scenario (val)	Sigma X Variance ($m^2/s^2$)0.94	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord