Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Beyond Individual Input for Deep Anomaly Detection on Tabular Data

About

Anomaly detection is vital in many domains, such as finance, healthcare, and cybersecurity. In this paper, we propose a novel deep anomaly detection method for tabular data that leverages Non-Parametric Transformers (NPTs), a model initially proposed for supervised tasks, to capture both feature-feature and sample-sample dependencies. In a reconstruction-based framework, we train an NPT to reconstruct masked features of normal samples. In a non-parametric fashion, we leverage the whole training set during inference and use the model's ability to reconstruct the masked features to generate an anomaly score. To the best of our knowledge, this is the first work to successfully combine feature-feature and sample-sample dependencies for anomaly detection on tabular datasets. Through extensive experiments on 31 benchmark tabular datasets, we demonstrate that our method achieves state-of-the-art performance, outperforming existing methods by 2.4% and 1.2% in terms of F1-score and AUROC, respectively. Our ablation study further proves that modeling both types of dependencies is crucial for anomaly detection on tabular data.

Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Li\^en Doan• 2023

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionWBC
ROCAUC0.9619
104
Tabular Anomaly Detectionpima
AUC ROC0.7553
70
Tabular Anomaly DetectionBreastW
AUC-ROC0.9848
67
Anomaly DetectionMammography
AUC-ROC0.8873
64
Anomaly Detectionsatellite
AUC80.6
62
Anomaly DetectionShuttle
AUC0.9931
61
Anomaly DetectionSatimage 2
AUC99.95
58
Tabular Anomaly Detectionpendigits
AUC-ROC99.87
56
Tabular Anomaly DetectionWine
AUC-ROC0.9622
56
Tabular Anomaly DetectionVertebral
AUC-ROC53.47
50
Showing 10 of 34 rows

Other info

Follow for update