Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Disentangling Tabular Data Towards Better One-Class Anomaly Detection

About

Tabular anomaly detection under the one-class classification setting poses a significant challenge, as it involves accurately conceptualizing "normal" derived exclusively from a single category to discern anomalies from normal data variations. Capturing the intrinsic correlation among attributes within normal samples presents one promising method for learning the concept. To do so, the most recent effort relies on a learnable mask strategy with a reconstruction task. However, this wisdom may suffer from the risk of producing uniform masks, i.e., essentially nothing is masked, leading to less effective correlation learning. To address this issue, we presume that attributes related to others in normal samples can be divided into two non-overlapping and correlated subsets, defined as CorrSets, to capture the intrinsic correlation effectively. Accordingly, we introduce an innovative method that disentangles CorrSets from normal tabular data. To our knowledge, this is a pioneering effort to apply the concept of disentanglement for one-class anomaly detection on tabular data. Extensive experiments on 20 tabular datasets show that our method substantially outperforms the state-of-the-art methods and leads to an average performance improvement of 6.1% on AUC-PR and 2.1% on AUC-ROC. Codes are available at https://github.com/yjnanan/Disent-AD.

Jianan Ye, Zhaorui Tan, Yijie Hu, Xi Yang, Guangliang Cheng, Kaizhu Huang• 2024

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionWBC--
104
Tabular Anomaly Detectionpima--
70
Anomaly DetectionShuttle--
61
Anomaly DetectionSatimage 2--
58
Outlier DetectionYeast
AUC-PR0.5125
49
Anomaly DetectionLympho--
40
Anomaly DetectionAMAZON
AUPRC11.82
33
Anomaly DetectionFraud
AUC-PR0.6142
31
Anomaly DetectionOptdigits
AUPRC84.24
27
Anomaly DetectionThyroid
AUPRC86.85
27
Showing 10 of 62 rows

Other info

Follow for update