Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GuardAD: Safeguarding Autonomous Driving MLLMs via Markovian Safety Logic

About

Multimodal large language models (MLLMs) are increasingly integrated into autonomous driving (AD) systems; however, they remain vulnerable to diverse safety threats, particularly in accident-prone scenarios. Recent safeguard mechanisms have shown promise by incorporating logical constraints, yet most rely on static formulations that lack temporally grounded safety reasoning over evolving traffic interactions, resulting in limited robustness in dynamic driving environments. To address these limitations, we propose GuardAD, a model-agnostic safeguard that formulates AD safety as an evolving Markovian logical state. GuardAD introduces Neuro-Symbolic Logic Formalization, which represents safety predicates over heterogeneous traffic participants and continuously induces them via n-th order Markovian Logic Induction. This design enables the inference of emerging and latent hazards beyond single-step observations. Rather than simply vetoing unsafe actions, GuardAD performs Logic-Driven Action Revision, where inferred safety states actively guide action refinement without modifying the underlying MLLM. Extensive experiments on multiple benchmarks and AD-MLLMs demonstrate that GuardAD substantially reduces accident rates (-32.07%) while slightly improving task performance (+6.85%). Moreover, closed-loop simulation evaluations, together with physical-world vehicle studies, further validate the effectiveness and potential of GuardAD.

Tianyuan Zhang, Peng Yue, Zihao Peng, Jiangfan Liu, Zonghao Ying, Jiakai Wang, Tianlin Li, Jian Yang, Yaodong Yang, Aishan Liu, Xianglong Liu• 2026

Related benchmarks

TaskDatasetResultRank
Autonomous driving reasoning (cross-view risk object perception, action prediction, and planning)DriveLM--
12
Autonomous Driving EvaluationDriveLM VRU-Accident benchmark
GAR43.05
10
Autonomous Driving ReasoningDolphins
GS Score48.95
5
Autonomous Driving ReasoningEM-VLM4AD
GS Score68.42
5
Autonomous Driving EvaluationDolphins VRU-Accident benchmark
GAR32.5
5
Autonomous Driving SafetyVRU-Accident (Clean)
GAR35.61
5
Autonomous Driving SafetyVRU-Accident CoA attack
GAR50.61
5
Autonomous Driving SafetyVRU-Accident (ADvLM attack)
GAR50.61
5
Autonomous Driving SafetyVRU-Accident CAD attack
GAR52.61
5
Showing 9 of 9 rows

Other info

Follow for update