$K^4$: Online Log Anomaly Detection Via Unsupervised Typicality Learning
About
Existing Log Anomaly Detection (LogAD) methods are often slow, dependent on error-prone parsing, and use unrealistic evaluation protocols. We introduce $K^4$, an unsupervised and parser-independent framework for high-performance online detection. $K^4$ transforms arbitrary log embeddings into compact four-dimensional descriptors (Precision, Recall, Density, Coverage) using efficient k-nearest neighbor (k-NN) statistics. These descriptors enable lightweight detectors to accurately score anomalies without retraining. Using a more realistic online evaluation protocol, $K^4$ sets a new state-of-the-art (AUROC: 0.995-0.999), outperforming baselines by large margins while being orders of magnitude faster, with training under 4 seconds and inference as low as 4 $\mu$s.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Log Anomaly Detection | HDFS (test) | AUROC99.6 | 7 | |
| Log Anomaly Detection | BGL (test) | AUROC99.9 | 7 | |
| Log Anomaly Detection | Thunderbird (TB) (test) | AUROC0.999 | 7 | |
| Log Anomaly Detection | HDFS | -- | 6 | |
| Log Anomaly Detection | BGL | -- | 6 | |
| Log Anomaly Detection | Thunderbird | -- | 6 |