Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Assigning Confidence: K-partition Ensembles

About

Clustering is widely used for unsupervised structure discovery, yet it offers limited insight into how reliable each individual assignment is. Diagnostics, such as convergence behavior or objective values, may reflect global quality, but they do not indicate whether particular instances are assigned confidently, especially for initialization-sensitive algorithms like k-means. This assignment-level instability can undermine both accuracy and robustness. Ensemble approaches improve global consistency by aggregating multiple runs, but they typically lack tools for quantifying pointwise confidence in a way that combines cross-run agreement with geometric support from the learned cluster structure. We introduce CAKE (Confidence in Assignments via K-partition Ensembles), a framework that evaluates each point using two complementary statistics computed over a clustering ensemble: assignment stability and consistency of local geometric fit. These are combined into a single, interpretable score in [0,1]. Our theoretical analysis shows that CAKE remains effective under noise and separates stable from unstable points. Experiments on synthetic and real-world datasets indicate that CAKE effectively highlights ambiguous points and stable core members, providing a confidence ranking that can guide filtering or prioritization to improve clustering quality.

Aggelos Semoglou, John Pavlopoulos• 2026

Related benchmarks

TaskDatasetResultRank
ClusteringDG
ARI79.3
7
ClusteringNG
ARI0.494
7
ClusteringFM
ARI47.4
7
ClusteringPD
ARI0.759
7
ClusteringLT
ARI0.222
7
ClusteringDG various subsets
Avg Silhouette0.237
7
ClusteringSA
ARI0.682
7
ClusteringBC
Avg Silhouette Score0.557
7
ClusteringNG
Avg Silhouette Score0.108
7
ClusteringPD
Avg Silhouette0.394
7
Showing 10 of 28 rows

Other info

Follow for update