Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video Event Restoration Based on Keyframes for Video Anomaly Detection

About

Video anomaly detection (VAD) is a significant computer vision problem. Existing deep neural network (DNN) based VAD methods mostly follow the route of frame reconstruction or frame prediction. However, the lack of mining and learning of higher-level visual features and temporal context relationships in videos limits the further performance of these two approaches. Inspired by video codec theory, we introduce a brand-new VAD paradigm to break through these limitations: First, we propose a new task of video event restoration based on keyframes. Encouraging DNN to infer missing multiple frames based on video keyframes so as to restore a video event, which can more effectively motivate DNN to mine and learn potential higher-level visual features and comprehensive temporal context relationships in the video. To this end, we propose a novel U-shaped Swin Transformer Network with Dual Skip Connections (USTN-DSC) for video event restoration, where a cross-attention and a temporal upsampling residual skip connection are introduced to further assist in restoring complex static and dynamic motion object features in the video. In addition, we propose a simple and effective adjacent frame difference loss to constrain the motion consistency of the video sequence. Extensive experiments on benchmarks demonstrate that USTN-DSC outperforms most existing methods, validating the effectiveness of our method.

Zhiwei Yang, Jing Liu, Zhaoyang Wu, Peng Wu, Xiaotao Liu• 2023

Related benchmarks

TaskDatasetResultRank
Video Anomaly DetectionCUHK Avenue (Ave) (test)
AUC89.9
203
Video Anomaly DetectionShanghaiTech (test)--
194
Abnormal Event DetectionUCSD Ped2 (test)--
146
Abnormal Event DetectionUCSD Ped2
AUC98.1
132
Video Anomaly DetectionAvenue (test)
AUC (Micro)89.9
85
Anomaly DetectionShanghaiTech
AUROC0.738
68
Video Anomaly DetectionCUHK Avenue
Frame AUC89.9
65
Video Anomaly DetectionShanghaiTech standard (test)
Frame-Level AUC73.8
50
Video Anomaly DetectionShanghaiTech (SHT) (test)
Frame-level AUC73.8
44
Video Anomaly DetectionUCSD Ped2 (test)
Frame-level AUC98.1
35
Showing 10 of 15 rows

Other info

Follow for update