Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Outlier detection in mixed-attribute data: a semi-supervised approach with fuzzy approximations and relative entropy

About

Outlier detection is a critical task in data mining, aimed at identifying objects that significantly deviate from the norm. Semi-supervised methods improve detection performance by leveraging partially labeled data but typically overlook the uncertainty and heterogeneity of real-world mixed-attribute data. This paper introduces a semi-supervised outlier detection method, namely fuzzy rough sets-based outlier detection (FROD), to effectively handle these challenges. Specifically, we first utilize a small subset of labeled data to construct fuzzy decision systems, through which we introduce the attribute classification accuracy based on fuzzy approximations to evaluate the contribution of attribute sets in outlier detection. Unlabeled data is then used to compute fuzzy relative entropy, which provides a characterization of outliers from the perspective of uncertainty. Finally, we develop the detection algorithm by combining attribute classification accuracy with fuzzy relative entropy. Experimental results on 16 public datasets show that FROD is comparable with or better than leading detection algorithms. All datasets and source codes are accessible at https://github.com/ChenBaiyang/FROD. This manuscript is the accepted author version of a paper published by Elsevier. The final published version is available at https://doi.org/10.1016/j.ijar.2025.109373

Baiyang Chen, Zhong Yuan, Zheng Liu, Dezhong Peng, Yongxiang Li, Chang Liu, Guiduo Duan• 2025

Related benchmarks

TaskDatasetResultRank
Tabular Anomaly Detectionpima
AUC ROC0.6434
53
Anomaly DetectionMammography
AUC-ROC0.8839
47
Anomaly Detectionsatellite
AUC82.08
41
Anomaly DetectionSatimage 2
AUC97.82
41
Outlier DetectionThyroid
AUC99.29
33
Outlier DetectionMushroom2
AUC0.985
33
Anomaly DetectionPageblocks
AUC-ROC90.79
32
Outlier Detectionmusk
AP100
22
Outlier DetectionPageblocks
AP (%)58.82
22
Outlier Detectionmusk
AUC1
22
Showing 10 of 29 rows

Other info

Follow for update