Scalable unsupervised feature selection via weight stability
About
Unsupervised feature selection is critical for improving clustering performance in high-dimensional data, where irrelevant features can obscure meaningful structure. In this work, we introduce the Minkowski weighted k-means++, a novel initialisation strategy for the Minkowski Weighted k-means. Our initialisation selects centroids probabilistically using feature relevance estimates derived from the data itself. Building on this, we propose two new feature selection algorithms, FS-MWK++, which aggregates feature weights across a range of Minkowski exponents to identify stable and informative features, and SFS-MWK++, a scalable variant based on subsampling. We support our approach with a theoretical analysis, demonstrating that, under explicit assumptions on noise features and cluster structure, relevant features are assigned consistently higher weights than noise features across a range of Minkowski exponents. Our software can be found at https://github.com/xzhang4-ops1/FSMWK.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Feature Selection | Synthetic 2000x20-5 +10NF | Mean Selection Rate100 | 8 | |
| Feature Selection | Synthetic 2000x20-20 +10NF | Mean Correct Feature Proportion100 | 8 | |
| Feature Selection | Synthetic 2000x30-5 +15NF | Mean Correct Feature Proportion100 | 8 | |
| Feature Selection | Synthetic 2000x30-10 +15NF | Mean Correct Feature Proportion100 | 8 | |
| Feature Selection | 2000x30-20 +15NF synthetic | Mean Proportion Correct Features100 | 8 | |
| Feature Selection | 1000x4-3 +2NF synthetic | Mean Proportion Correct96 | 8 | |
| Feature Selection | 2NF synthetic 1000x4-5 | Mean Correct Features Proportion99 | 8 | |
| Feature Selection | Synthetic 1000x4-10 +2NF | Mean Selection Proportion98 | 8 | |
| Feature Selection | Synthetic 1000x10-3 +5NF | Mean Correct Feature Proportion100 | 8 | |
| Feature Selection | 1000x10-5 +5NF synthetic | Mean Proportion Correct Features100 | 8 |