SPORE: Skeleton Propagation Over Recalibrating Expansions
About
Clustering is a foundational task in data analysis, yet most algorithms impose rigid assumptions on cluster geometry: centroid-based methods favor convex structures, while density-based approaches break down under variable local density or moderate dimensionality. This paper introduces SPORE (Skeleton Propagation Over Recalibrating Expansions), a classical clustering algorithm built to handle arbitrary geometry without relying on global density parameters. SPORE grows clusters through a nearest-neighbor graph, admitting new points based on each cluster's own evolving distance statistics, with density-ordered seeding enabling recovery of nested and asymmetrically separated structures. A refinement stage exploits initial over-segmentation, propagating high-confidence cluster skeletons outward to resolve ambiguous boundaries in low-contrast regions. Across 28 diverse benchmark datasets, SPORE achieves a statistically significant improvement in ARI-based recovery capacity over all evaluated baselines, with strong performance accessible within ten evaluations of a fixed hyperparameter grid.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | Wine | ARI0.84 | 43 | |
| Clustering | pendigits | NMI (%)85 | 32 | |
| Clustering | E.coli | ARI0.72 | 24 | |
| Clustering | Digits | ARI0.85 | 23 | |
| Clustering | SEEDS | -- | 22 | |
| Clustering | SEEDS | ARI0.75 | 17 | |
| Clustering | WDBC | ARI0.65 | 17 | |
| Clustering | pendigits | Average Time0.36 | 14 | |
| Clustering | Fashion | Running Time7.31 | 10 | |
| Clustering | Pathbased | ARI93 | 9 |