Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Calibrating Tabular Anomaly Detection via Optimal Transport

About

Tabular anomaly detection (TAD) remains challenging due to the heterogeneity of tabular data: features lack natural relationships, vary widely in distribution and scale, and exhibit diverse types. Consequently, each TAD method makes implicit assumptions about anomaly patterns that work well on some datasets but fail on others, and no method consistently outperforms across diverse scenarios. We present CTAD (Calibrating Tabular Anomaly Detection), a model-agnostic post-processing framework that enhances any existing TAD detector through sample-specific calibration. Our approach characterizes normal data via two complementary distributions, i.e., an empirical distribution from random sampling and a structural distribution from K-means centroids, and measures how adding a test sample disrupts their compatibility using Optimal Transport (OT) distance. Normal samples maintain low disruption while anomalies cause high disruption, providing a calibration signal to amplify detection. We prove that OT distance has a lower bound proportional to the test sample's distance from centroids, and establish that anomalies systematically receive higher calibration scores than normals in expectation, explaining why the method generalizes across datasets. Extensive experiments on 34 diverse tabular datasets with 7 representative detectors spanning all major TAD categories (density estimation, classification, reconstruction, and isolation-based methods) demonstrate that CTAD consistently improves performance with statistical significance. Remarkably, CTAD enhances even state-of-the-art deep learning methods and shows robust performance across diverse hyperparameter settings, requiring no additional tuning for practical deployment.

Hangting Ye, He Zhao.Wei Fan, Xiaozhuang Song, Dandan Guo, Yi Chang, Hongyuan Zha• 2026

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionWBC
ROCAUC1
104
Anomaly DetectionMNIST
AUC97.36
87
Tabular Anomaly Detectionpima
AUC ROC0.7511
70
Tabular Anomaly DetectionBreastW
AUC-ROC0.9965
67
Anomaly DetectionMammography
AUC-ROC0.9129
64
Anomaly Detectionsatellite
AUC87.04
62
Anomaly DetectionShuttle
AUC0.9993
61
Anomaly DetectionSatimage 2
AUC99.83
58
Tabular Anomaly Detectionpendigits
AUC-ROC99.9
56
Tabular Anomaly DetectionWine
AUC-ROC1
56
Showing 10 of 34 rows

Other info

Follow for update