Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos
About
Appearance features have been widely used in video anomaly detection even though they contain complex entangled factors. We propose a new method to model the normal patterns of human movements in surveillance video for anomaly detection using dynamic skeleton features. We decompose the skeletal movements into two sub-components: global body movement and local body posture. We model the dynamics and interaction of the coupled features in our novel Message-Passing Encoder-Decoder Recurrent Network. We observed that the decoupled features collaboratively interact in our spatio-temporal model to accurately identify human-related irregular events from surveillance video sequences. Compared to traditional appearance-based models, our method achieves superior outlier detection performance. Our model also offers "open-box" examination and decision explanation made possible by the semantically understandable features and a network architecture supporting interpretability.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Anomaly Detection | ShanghaiTech (test) | AUC0.734 | 194 | |
| Video Anomaly Detection | ShanghaiTech | Micro AUC0.734 | 51 | |
| Video Anomaly Detection | ShanghaiTech standard (test) | Frame-Level AUC73.4 | 50 | |
| Video Anomaly Detection | Avenue | Frame-AUC86.3 | 29 | |
| Video Anomaly Detection | UBnormal | AUC60.6 | 25 | |
| Video Anomaly Detection | ShanghaiTech (SHTech) (test) | AUROC0.734 | 24 | |
| Anomaly Detection | ShanghaiTech Campus (test) | Micro AUROC73.4 | 22 | |
| Video Anomaly Detection | ShanghaiTech classic (test) | AUC73.4 | 16 | |
| Video Anomaly Detection | HR-Avenue | Frame-AUC86.3 | 15 | |
| Anomaly Detection | Kinetics-250 Few vs. Many Random | AUC57 | 12 |