An Attribute-based Method for Video Anomaly Detection
About
Video anomaly detection (VAD) identifies suspicious events in videos, which is critical for crime prevention and homeland security. In this paper, we propose a simple but highly effective VAD method that relies on attribute-based representations. The base version of our method represents every object by its velocity and pose, and computes anomaly scores by density estimation. Surprisingly, this simple representation is sufficient to achieve state-of-the-art performance in ShanghaiTech, the most commonly used VAD dataset. Combining our attribute-based representations with an off-the-shelf, pretrained deep representation yields state-of-the-art performance with a $99.1\%, 93.7\%$, and $85.9\%$ AUROC on Ped2, Avenue, and ShanghaiTech, respectively.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Anomaly Detection | ShanghaiTech (test) | -- | 194 | |
| Abnormal Event Detection | UCSD Ped2 (test) | -- | 146 | |
| Abnormal Event Detection | UCSD Ped2 | -- | 132 | |
| Video Anomaly Detection | UCF-Crime | -- | 129 | |
| Video Anomaly Detection | UCF-Crime (test) | -- | 122 | |
| Video Anomaly Detection | Avenue (test) | AUC (Micro)93.7 | 85 | |
| Video Anomaly Detection | CUHK Avenue | Frame AUC93.6 | 65 | |
| Video Anomaly Detection | ShanghaiTech | Micro AUC0.859 | 51 | |
| Video Anomaly Detection | ShanghaiTech standard (test) | Frame-Level AUC76.2 | 50 | |
| Video Anomaly Detection | UBnormal (test) | -- | 37 |