Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

About

Recent advancements in reasoning capability of Multimodal Large Language Models (MLLMs) demonstrate its effectiveness in tackling complex visual tasks. However, existing MLLM-based Video Anomaly Detection (VAD) methods remain limited to shallow anomaly descriptions without deep reasoning. In this paper, we propose a new task named Video Anomaly Reasoning (VAR), which aims to enable deep analysis and understanding of anomalies in the video by requiring MLLMs to think explicitly before answering. To this end, we propose Vad-R1, an end-to-end MLLM-based framework for VAR. Specifically, we design a Perception-to-Cognition Chain-of-Thought (P2C-CoT) that simulates the human process of recognizing anomalies, guiding the MLLM to reason anomaly step-by-step. Based on the structured P2C-CoT, we construct Vad-Reasoning, a dedicated dataset for VAR. Furthermore, we propose an improved reinforcement learning algorithm AVA-GRPO, which explicitly incentivizes the anomaly reasoning capability of MLLMs through a self-verification mechanism with limited annotations. Experimental results demonstrate that Vad-R1 achieves superior performance, outperforming both open-source and proprietary models on VAD and VAR tasks. Codes and datasets will be released at https://github.com/wbfwonderful/Vad-R1.

Chao Huang, Benfeng Wang, Jie Wen, Chengliang Liu, Wei Wang, Li Shen, Xiaochun Cao• 2025

Related benchmarks

Task	Dataset	Result
Video Anomaly Detection	UCF-Crime (test)	AUC84.45	164
Video Anomaly Detection	XD-Violence (test)	--	164
Multi-choice Question Answering	Vad-Reasoning-Plus	MCQ Score92.9	27
Open-ended Question Answering	Vad-Reasoning-Plus	BLEU-30.023	27
Video Anomaly Reasoning	Video Anomaly Reasoning (test)	RR Score0.286	27
Traffic Anomaly Summarization	Roundabout-TAU	BLEU13.31	11
Traffic Anomaly Classification	Roundabout-TAU	AP (4-cls)23.81	10
Video Anomaly Detection	ShanghaiTech Campus (test)	AUC51.58	7

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord