Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation
About
Log anomaly detection is a critical task for system operations and security assurance. However, in networked systems at scale, log data are generated at massive scale while instance-level annotations are prohibitively expensive, posing great difficulties to fine-grained anomaly localization. To address this challenge, we propose LogMILP (Log anomaly localization based on Multi-Instance Learning enhanced by prototypes and Perturbation), a weakly supervised framework that enables both bag-level anomaly detection and instance-level anomaly localization using only bag-level labels. Our method guides the model to pinpoint the critical log entries using prototype-guided structural modeling with counterfactual perturbation consistency regularization, thereby improving localization reliability and interpretability under coarse-grained supervision. Experimental results on three public datasets demonstrate that LogMILP achieves competitive detection performance while yielding significantly more reliable instance-level localization. Our code is open-sourced at https://github.com/YUK1207/LogMILP.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Log Anomaly Detection | BGL | F1 Score93.42 | 30 | |
| Bag-level Anomaly Detection | SPIRIT | AUC96.52 | 6 | |
| Bag-level Anomaly Detection | Zookeeper | AUC0.9964 | 6 | |
| Instance-level anomaly localization | SPIRIT | Loc@377.86 | 3 | |
| Instance-level anomaly localization | Zookeeper | Loc@389.17 | 3 | |
| Instance-level anomaly localization | BGL | Loc@30.3488 | 3 |