Towards Robust and Scalable Density-based Clustering via Graph Propagation
About
We present \textit{CluProp}, a novel framework that reimagines varied-density clustering in high-dimensional spaces as a label propagation process over neighborhood graphs. Our approach formally bridges the gap between density-based clustering and graph connectivity, leveraging efficient propagation mechanisms from network science to mitigate the parameter sensitivity inherent in traditional density-based methods. Specifically, we introduce a deterministic density-based propagation strategy to ensure scalable neighborhood identification. The framework is agnostic to the choice of distance metric and exhibits superior performance on large-scale data, processing millions of points in minutes while consistently outperforming existing baselines in accuracy.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | pendigits | ARI76 | 49 | |
| Clustering | Dermatology | AMI0.89 | 26 | |
| Clustering | SEMEION | ARI58 | 19 | |
| Clustering | MULTI-FEAT | AMI93 | 18 | |
| Clustering | Letters | AMI63 | 16 | |
| Clustering | MNIST | AMI89 | 10 | |
| Clustering | USPS | AMI87 | 9 | |
| Clustering | Soybean | AMI72 | 9 | |
| Clustering | MNIST 8M | NMI81 | 8 | |
| Clustering | MNIST8M | AMI80 | 4 |