Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ExDBSCAN: Explaining DBSCAN with Counterfactual Reasoning -- Additional Material

About

Clustering is an unsupervised technique for grouping data points by similarity. While explainability methods exist for supervised machine learning, they are not directly applicable to clustering, making it challenging to understand cluster assignments. This interpretability gap is particularly evident in the popular density-based method DBSCAN, which assigns points as inliers (cluster members in dense regions) or outliers (noise points in sparse regions). DBSCAN does not provide insight into why a particular point receives its assignment or whether its assignment is robust to small changes in the data. To address the lack of explainability, we introduce ExDBSCAN, a density-aware, post-hoc explanation method. ExDBSCAN offers actionable counterfactual explanations, with theoretical guarantees for validity. It generates multiple counterfactuals using a density connected weighted graph, adopting a physics-inspired model that repels counterfactual candidates from one another (diversity), while pulling them toward the instance to explain (proximity). Empirical evaluation on 30 tabular datasets comparing against four baselines shows that ExDBSCAN outperforms all baselines while attaining perfect validity and retrieving diverse, proximal counterfactuals.

Pernille Matthews, Lena Krieger, Tommaso Amico, Artur Zimek, Thomas Seidl, Ira Assent• 2026

Related benchmarks

TaskDatasetResultRank
Counterfactual Explanation GenerationIris
L2 Distance1.5
30
Counterfactual Explanation GenerationDiabetes
L2 Error2.9
29
Counterfactual ExplanationDiabetes
Validity100
28
Counterfactual ExplanationGlass
Validity100
24
Counterfactual Explanationbaskball
Validity1
24
Counterfactual Explanationchscase census2
Validity1
24
Counterfactual Explanation Generationblood-transfusion
Validity100
20
Counterfactual Explanation Generationchscase vine1
Validity100
20
Counterfactual Explanation Generationlongley
Validity100
20
Counterfactual Explanation GenerationautoPrice
Validity100
20
Showing 10 of 133 rows
...

Other info

Follow for update