GuardAD: Safeguarding Autonomous Driving MLLMs via Markovian Safety Logic

About

Multimodal large language models (MLLMs) are increasingly integrated into autonomous driving (AD) systems; however, they remain vulnerable to diverse safety threats, particularly in accident-prone scenarios. Recent safeguard mechanisms have shown promise by incorporating logical constraints, yet most rely on static formulations that lack temporally grounded safety reasoning over evolving traffic interactions, resulting in limited robustness in dynamic driving environments. To address these limitations, we propose GuardAD, a model-agnostic safeguard that formulates AD safety as an evolving Markovian logical state. GuardAD introduces Neuro-Symbolic Logic Formalization, which represents safety predicates over heterogeneous traffic participants and continuously induces them via n-th order Markovian Logic Induction. This design enables the inference of emerging and latent hazards beyond single-step observations. Rather than simply vetoing unsafe actions, GuardAD performs Logic-Driven Action Revision, where inferred safety states actively guide action refinement without modifying the underlying MLLM. Extensive experiments on multiple benchmarks and AD-MLLMs demonstrate that GuardAD substantially reduces accident rates (-32.07%) while slightly improving task performance (+6.85%). Moreover, closed-loop simulation evaluations, together with physical-world vehicle studies, further validate the effectiveness and potential of GuardAD.

Tianyuan Zhang, Peng Yue, Zihao Peng, Jiangfan Liu, Zonghao Ying, Jiakai Wang, Tianlin Li, Jian Yang, Yaodong Yang, Aishan Liu, Xianglong Liu• 2026

Related benchmarks

Task	Dataset	Result
Autonomous driving reasoning (cross-view risk object perception, action prediction, and planning)	DriveLM	--	25
Autonomous Driving Evaluation	DriveLM VRU-Accident benchmark	GAR43.05	10
Autonomous Driving Reasoning	Dolphins	GS Score48.95	5
Autonomous Driving Reasoning	EM-VLM4AD	GS Score68.42	5
Autonomous Driving Evaluation	Dolphins VRU-Accident benchmark	GAR32.5	5
Autonomous Driving Safety	VRU-Accident (Clean)	GAR35.61	5
Autonomous Driving Safety	VRU-Accident CoA attack	GAR50.61	5
Autonomous Driving Safety	VRU-Accident (ADvLM attack)	GAR50.61	5
Autonomous Driving Safety	VRU-Accident CAD attack	GAR52.61	5

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord