Abnormal Event Detection in Videos using Spatiotemporal Autoencoder
About
We present an efficient method for detecting anomalies in videos. Recent applications of convolutional neural networks have shown promises of convolutional layers for object detection and recognition, especially in images. However, convolutional neural networks are supervised and require labels as learning signals. We propose a spatiotemporal architecture for anomaly detection in videos including crowded scenes. Our architecture includes two main components, one for spatial feature representation, and one for learning the temporal evolution of the spatial features. Experimental results on Avenue, Subway and UCSD benchmarks confirm that the detection accuracy of our method is comparable to state-of-the-art methods at a considerable speed of up to 140 fps.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Abnormal Event Detection | UCSD Ped2 | AUC87.4 | 132 | |
| Abnormal Event Detection | UCSD Ped1 | AUC0.899 | 28 | |
| Abnormal Event Detection | Subway Exit | -- | 14 | |
| Abnormal Event Detection | Subway Entrance | -- | 11 | |
| Anomaly Detection | Subway Entrance GT: 66 (test) | Anomalous Event Detected (%)0.61 | 5 | |
| Anomaly Detection | Subway Exit GT: 19 (test) | Anomalous Event Detected Rate18 | 5 | |
| Abnormal Event Detection | CUHK Avenue Old and New | -- | 3 | |
| Anomaly Detection | Avenue GT: 47 (test) | Detected Anomalies Rate44 | 2 | |
| Anomaly Detection | Avenue GT: 14 (smaller set) | -- | 1 |