Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Anomize: Better Open Vocabulary Video Anomaly Detection

About

Open Vocabulary Video Anomaly Detection (OVVAD) seeks to detect and classify both base and novel anomalies. However, existing methods face two specific challenges related to novel anomalies. The first challenge is detection ambiguity, where the model struggles to assign accurate anomaly scores to unfamiliar anomalies. The second challenge is categorization confusion, where novel anomalies are often misclassified as visually similar base instances. To address these challenges, we explore supplementary information from multiple sources to mitigate detection ambiguity by leveraging multiple levels of visual data alongside matching textual information. Furthermore, we propose incorporating label relations to guide the encoding of new labels, thereby improving alignment between novel videos and their corresponding labels, which helps reduce categorization confusion. The resulting Anomize framework effectively tackles these issues, achieving superior performance on UCF-Crime and XD-Violence datasets, demonstrating its effectiveness in OVVAD.

Fei Li, Wenxuan Liu, Jingjing Chen, Ruixu Zhang, Yuran Wang, Xian Zhong, Zheng Wang• 2025

Related benchmarks

TaskDatasetResultRank
Video Anomaly DetectionUCF-Crime (test)
AUC84.49
122
Video Anomaly DetectionXD-Violence (test)
AP69.31
119
Video Anomaly DetectionUCF-Crime (frame-level)
AUC84.49
32
Frame-level Video Anomaly DetectionXD-Violence
AP69.31
11
Anomaly CategorizationXD-Violence (test)
ACC90.29
2
Anomaly CategorizationUCF-Crime (test)
ACC47.14
2
Showing 6 of 6 rows

Other info

Follow for update