Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions
About
Continual anomaly detection (CAD) addresses the need for industrial inspection systems to adapt to evolving production conditions, yet existing methods share three critical gaps: unrealistic evaluation, no systematic comparison, and no consideration of edge deployment constraints. We introduce a unified benchmark combining discrete-task evaluation on structural and logical anomalies, a novel continuous drift protocol, the first head-to-head comparison of all published CAD methods, and computational efficiency profiling on edge hardware. Our results reveal that existing CAD methods do not consistently outperform traditional approaches with simple experience replay. Thus motivated, we propose DINOSaur, a training-free method combining a frozen DINOv3 backbone with spatially-indexed coreset memory and neighborhood-restricted anomaly scoring. DINOSaur achieves zero forgetting by construction, outperforms all evaluated methods across all five protocols, and runs at sub-100\,ms inference on an NVIDIA Jetson Orin Nano, with on-device adaptation to new tasks in under 30 seconds.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image-level Anomaly Detection | MVTec-AD (test) | Overall AUROC96.3 | 92 | |
| Image-level Anomaly Detection | MVTec LOCO AD (test) | AUROC76 | 16 | |
| Continual Anomaly Detection | MTD-Color averaged across all tasks | AUROC0.888 | 6 | |
| Continual Anomaly Detection | MTD-Geo averaged across all tasks | AUROC53.2 | 6 | |
| Continual Anomaly Detection | MTD-Blur averaged across all tasks | AUROC86.4 | 6 | |
| Continual Anomaly Detection | MVTec AD | Inference Time (ms)630.6 | 6 | |
| Continual Anomaly Detection | MVTec AD | Inference Latency (ms)91.6 | 5 |