Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection

About

Video anomaly detection under weak supervision presents significant challenges, particularly due to the lack of frame-level annotations during training. While prior research has utilized graph convolution networks and self-attention mechanisms alongside multiple instance learning (MIL)-based classification loss to model temporal relations and learn discriminative features, these methods often employ multi-branch architectures to capture local and global dependencies separately, resulting in increased parameters and computational costs. Moreover, the coarse-grained interclass separability provided by the binary constraint of MIL-based loss neglects the fine-grained discriminability within anomalous classes. In response, this paper introduces a weakly supervised anomaly detection framework that focuses on efficient context modeling and enhanced semantic discriminability. We present a Temporal Context Aggregation (TCA) module that captures comprehensive contextual information by reusing the similarity matrix and implementing adaptive fusion. Additionally, we propose a Prompt-Enhanced Learning (PEL) module that integrates semantic priors using knowledge-based prompts to boost the discriminative capacity of context features while ensuring separability between anomaly sub-classes. Extensive experiments validate the effectiveness of our method's components, demonstrating competitive performance with reduced parameters and computational effort on three challenging benchmarks: UCF-Crime, XD-Violence, and ShanghaiTech datasets. Notably, our approach significantly improves the detection accuracy of certain anomaly sub-classes, underscoring its practical value and efficacy. Our code is available at: https://github.com/yujiangpu20/PEL4VAD.

Yujiang Pu, Xiaoyu Wu, Lulu Yang, Shengjin Wang• 2023

Related benchmarks

TaskDatasetResultRank
Video Anomaly DetectionUCF-Crime
AUC86.76
218
Video Anomaly DetectionShanghaiTech (test)
AUC0.9814
211
Video Anomaly DetectionXD-Violence (test)
AP85.59
146
Anomaly DetectionUCF-Crime (test)
AUC0.5452
109
Temporal Anomaly LocalizationXD-Violence (test)
AP (%)43.53
18
Video Anomaly DetectionUCF-Crime 6 (clip-level)
Accuracy87.24
16
Video Anomaly DetectionXD-Violence
AP85.59
14
Temporal binary anomaly detectionTAD (test)
AUC86.27
7
Temporal binary anomaly detectionLAD (test)
AUC69.99
7
Temporal binary anomaly detectionDOTA (test)
AUC53.05
6
Showing 10 of 16 rows

Other info

Code

Follow for update