Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Consistency-guided semi-supervised outlier detection in heterogeneous data using fuzzy rough sets

About

Outlier detection aims to find samples that behave differently from the majority of the data. Semi-supervised detection methods can utilize the supervision of partial labels, thus reducing false positive rates. However, most of the current semi-supervised methods focus on numerical data and neglect the heterogeneity of data information. In this paper, we propose a consistency-guided outlier detection algorithm (COD) for heterogeneous data with the fuzzy rough set theory in a semi-supervised manner. First, a few labeled outliers are leveraged to construct label-informed fuzzy similarity relations. Next, the consistency of the fuzzy decision system is introduced to evaluate attributes' contributions to knowledge classification. Subsequently, we define the outlier factor based on the fuzzy similarity class and predict outliers by integrating the classification consistency and the outlier factor. The proposed algorithm is extensively evaluated on 15 freshly proposed datasets. Experimental results demonstrate that COD is better than or comparable with the leading outlier detectors. This manuscript is the accepted author version of a paper published by Elsevier. The final published version is available at https://doi.org/10.1016/j.asoc.2024.112070

Baiyang Chen, Zhong Yuan, Dezhong Peng, Xiaoliang Chen, Hongmei Chen• 2025

Related benchmarks

TaskDatasetResultRank
Tabular Anomaly DetectionBreastW
AUC-ROC0.989
50
Tabular Anomaly Detectionionosphere
AUC-ROC78.5
50
Anomaly DetectionMammography
AUC-ROC0.855
47
Outlier DetectionMushroom2
AUC0.976
33
Outlier DetectionThyroid
AUC74
33
Anomaly DetectionPageblocks
AUC-ROC0.889
32
Anomaly DetectionWilt
AUC-ROC58.3
27
Outlier DetectionAudiology
AUC-PR0.816
22
Outlier DetectionArrhythmia
AUC-ROC81.1
22
Outlier DetectionCredita
AUC-PR0.845
22
Showing 10 of 38 rows

Other info

Follow for update