A Language Agent for Autonomous Driving

About

Human-level driving is an ultimate goal of autonomous driving. Conventional approaches formulate autonomous driving as a perception-prediction-planning framework, yet their systems do not capitalize on the inherent reasoning ability and experiential knowledge of humans. In this paper, we propose a fundamental paradigm shift from current pipelines, exploiting Large Language Models (LLMs) as a cognitive agent to integrate human-like intelligence into autonomous driving systems. Our approach, termed Agent-Driver, transforms the traditional autonomous driving pipeline by introducing a versatile tool library accessible via function calls, a cognitive memory of common sense and experiential knowledge for decision-making, and a reasoning engine capable of chain-of-thought reasoning, task planning, motion planning, and self-reflection. Powered by LLMs, our Agent-Driver is endowed with intuitive common sense and robust reasoning capabilities, thus enabling a more nuanced, human-like approach to autonomous driving. We evaluate our approach on the large-scale nuScenes benchmark, and extensive experiments substantiate that our Agent-Driver significantly outperforms the state-of-the-art driving methods by a large margin. Our approach also demonstrates superior interpretability and few-shot learning ability to these methods.

Jiageng Mao, Junjie Ye, Yuxi Qian, Marco Pavone, Yue Wang• 2023

Related benchmarks

Task	Dataset	Result
Open-loop planning	nuScenes (val)	L2 Error (3s)0.61	225
Trajectory Planning	nuScenes	--	58
End-to-end Planning	nuScenes (open-loop)	L2 Error (1s)0.22	24

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord