LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends
About
With the broader adoption and highly successful development of Large Language Models (LLMs), there has been growing interest and demand for applying LLMs to autonomous driving technology. Driven by their natural language understanding and reasoning capabilities, LLMs have the potential to enhance various aspects of autonomous driving systems, from perception and scene understanding to interactive decision-making. This paper first introduces the novel concept of designing Large Language Models for Autonomous Driving (LLM4AD), followed by a review of existing LLM4AD studies. Then, a comprehensive benchmark is proposed for evaluating the instruction-following and reasoning abilities of LLM4AD systems, which includes LaMPilot-Bench, CARLA Leaderboard 1.0 Benchmark in simulation and NuPlanQA for multi-view visual question answering. Furthermore, extensive real-world experiments are conducted on autonomous vehicle platforms, examining both on-cloud and on-edge LLM deployment for personalized decision-making and motion control. Next, the future trends of integrating language diffusion models into autonomous driving are explored, exemplified by the proposed ViLaD (Vision-Language Diffusion) framework. Finally, the main challenges of LLM4AD are discussed, including latency, deployment, security and privacy, safety, trust and transparency, and personalization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Planning | nuScenes v1.0-trainval (val) | -- | 39 | |
| End-to-end Planning | nuScenes | L2 Error (3s)2.69 | 34 | |
| Autonomous Driving | CARLA Leaderboard official 1.0 (test) | Driving Score65.4 | 20 | |
| Multi-modal Reasoning | NuPlanQA EVAL | Traffic Light Accuracy61.1 | 18 | |
| Autonomous Driving | LaMPilot-Bench | -- | 18 | |
| Driving Performance Validation | Driving Performance Lane Change Scenario (val) | Time to Collision (s)2.15 | 3 | |
| Driving Performance Validation | Driving Performance Acceleration Scenario (val) | Time to Collision (s)2.46 | 3 | |
| Driving Performance Validation | Driving Performance Left Turn Scenario (val) | Sigma X Variance ($m^2/s^2$)0.94 | 3 |