Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Chain-of-Anomaly Thoughts with Large Vision-Language Models

About

Automated video surveillance with Large Vision-Language Models is limited by their inherent bias towards normality, often failing to detect crimes. While Chain-of-Thought reasoning strategies show significant potential for improving performance in language tasks, the lack of inductive anomaly biases in their reasoning further steers the models towards normal interpretations. To address this, we propose Chain-of-Anomaly-Thoughts (CoAT), a multi-agent reasoning framework that introduces inductive criminal bias in the reasoning process through a final, anomaly-focused classification layer. Our method significantly improves Anomaly Detection, boosting F1-score by 11.8 p.p. on challenging low-resolution footage and Anomaly Classification by 3.78 p.p. in high-resolution videos.

Pedro Domingos, Jo\~ao Pereira, Vasco Lopes, Jo\~ao Neves, David Semedo• 2025

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionUCF-Crime
F1-Score52.08
10
Anomaly ClassificationUCF-Crime
F1-Score42.68
10
Anomaly ClassificationBetterUCF
F1-Score51.89
6
Anomaly DetectionBetterUCF
F1 Score88.84
6
Showing 4 of 4 rows

Other info

Follow for update