Unsupervised Feature Selection via Robust Autoencoder and Adaptive Graph Learning
About
Effective feature selection is essential for high-dimensional data analysis and machine learning. Unsupervised feature selection (UFS) aims to simultaneously cluster data and identify the most discriminative features. Most existing UFS methods linearly project features into a pseudo-label space for clustering, but they suffer from two critical limitations: (1) an oversimplified linear mapping that fails to capture complex feature relationships, and (2) an assumption of uniform cluster distributions, ignoring outliers prevalent in real-world data. To address these issues, we propose the Robust Autoencoder-based Unsupervised Feature Selection (RAEUFS) model, which leverages a deep autoencoder to learn nonlinear feature representations while inherently improving robustness to outliers. We further develop an efficient optimization algorithm for RAEUFS. Extensive experiments demonstrate that our method outperforms state-of-the-art UFS approaches in both clean and outlier-contaminated data settings.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | COIL-20 | ACC61.02 | 47 | |
| Clustering | lung clean | ACC0.7108 | 4 | |
| Clustering | Isolet clean | ACC60.09 | 4 | |
| Clustering | WarpPIE10P clean | Accuracy29.82 | 4 | |
| Clustering | USPS clean | Accuracy65.78 | 4 | |
| Clustering | jaffe (30% outliers) | ACC89.76 | 4 | |
| Clustering | Isolet 30% outliers | Accuracy0.6195 | 4 | |
| Clustering | COIL20 (30% outliers) | Accuracy66.85 | 4 | |
| Clustering | USPS (30% outliers) | Accuracy64.56 | 4 | |
| Clustering | Jaffe clean | ACC82.25 | 4 |