Learning Normal Dynamics in Videos with Meta Prototype Network
About
Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory-consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a dynamic prototype unit (DPU) to encode the normal dynamics as prototypes in real time, free from extra memory cost. In addition, we introduce meta-learning to our DPU to form a novel few-shot normalcy learner, namely Meta-Prototype Unit (MPU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Anomaly Detection | CUHK Avenue (Ave) (test) | AUC89.5 | 203 | |
| Video Anomaly Detection | ShanghaiTech (test) | AUC0.738 | 194 | |
| Abnormal Event Detection | UCSD Ped2 (test) | AUC96.9 | 146 | |
| Abnormal Event Detection | UCSD Ped2 | -- | 132 | |
| Video Anomaly Detection | Avenue (test) | AUC (Micro)89.5 | 85 | |
| Anomaly Detection | Avenue | Frame AUC (Micro)89.5 | 55 | |
| Video Anomaly Detection | ShanghaiTech | Micro AUC0.738 | 51 | |
| Video Anomaly Detection | ShanghaiTech standard (test) | Frame-Level AUC73.8 | 50 | |
| Abnormal Event Detection | UCSD Ped1 (test) | Frame AUC85.1 | 33 | |
| Video Anomaly Detection | ShanghaiTech (SHTech) (test) | AUROC0.738 | 24 |