Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DCFO: Density-Based Counterfactuals for Outliers -- Additional Material

About

Outlier detection identifies data points that significantly deviate from the majority of the data distribution. Explaining outliers is crucial for understanding the underlying factors that contribute to their detection, validating their significance, and identifying potential biases or errors. Effective explanations provide actionable insights, facilitating preventive measures to avoid similar outliers in the future. Counterfactual explanations clarify why specific data points are classified as outliers by identifying minimal changes required to alter their prediction. Although valuable, most existing counterfactual explanation methods overlook the unique challenges posed by outlier detection, and fail to target classical, widely adopted outlier detection algorithms. Local Outlier Factor (LOF) is one the most popular unsupervised outlier detection methods, quantifying outlierness through relative local density. Despite LOF's widespread use across diverse applications, it lacks interpretability. To address this limitation, we introduce Density-based Counterfactuals for Outliers (DCFO), a novel method specifically designed to generate counterfactual explanations for LOF. DCFO partitions the data space into regions where LOF behaves smoothly, enabling efficient gradient-based optimisation. Extensive experimental validation on 50 OpenML datasets demonstrates that DCFO consistently outperforms benchmarked competitors, offering superior proximity and validity of generated counterfactuals.

Tommaso Amico, Pernille Matthews, Lena Krieger, Arthur Zimek, Ira Assent• 2025

Related benchmarks

TaskDatasetResultRank
Counterfactual ExplanationDiabetes
Validity100
28
Counterfactual Explanationbaskball
Validity1
24
Counterfactual ExplanationGlass
Validity100
24
Counterfactual Explanationchscase census2
Validity1
24
Counterfactual Explanationliver-disorders
Validity100
20
Counterfactual Explanation GenerationautoPrice
Validity100
20
Counterfactual Explanation Generationblood-transfusion
Validity100
20
Counterfactual Explanation Generationbodyfat
Validity100
20
Counterfactual Explanation Generationchscase census6
Validity1
20
Counterfactual Explanation Generationchscase vine1
Validity100
20
Showing 10 of 108 rows
...

Other info

Follow for update