Structured Graph Learning for Scalable Subspace Clustering: From Single-view to Multi-view
About
Graph-based subspace clustering methods have exhibited promising performance. However, they still suffer some of these drawbacks: encounter the expensive time overhead, fail in exploring the explicit clusters, and cannot generalize to unseen data points. In this work, we propose a scalable graph learning framework, seeking to address the above three challenges simultaneously. Specifically, it is based on the ideas of anchor points and bipartite graph. Rather than building a $n\times n$ graph, where $n$ is the number of samples, we construct a bipartite graph to depict the relationship between samples and anchor points. Meanwhile, a connectivity constraint is employed to ensure that the connected components indicate clusters directly. We further establish the connection between our method and the K-means clustering. Moreover, a model to process multi-view data is also proposed, which is linear scaled with respect to $n$. Extensive experiments demonstrate the efficiency and effectiveness of our approach with respect to many state-of-the-art clustering methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Clustering | STL-10 | -- | 229 | |
| Fair Graph Clustering | SBM uni. xi in [0.4, 0.6], n = 1000 | Cross-Entropy0.51 | 12 | |
| Clustering | MTFL | Balance38.9 | 12 | |
| Clustering | Reverse MNIST | Balance43.8 | 12 | |
| Fair Graph Clustering | SBM uni. xi in [0, 0.2], n = 1000 | Clustering Error (CE)0.64 | 12 | |
| Fair Graph Clustering | SBM uni. xi in [0, 0.2], n = 5000 | Clustering Error (CE)0.67 | 12 | |
| Fair Graph Clustering | SBM uni. xi in [0.4, 0.6], n = 5000 | Clustering Error (CE)0.69 | 12 | |
| Clustering | COIL | -- | 12 | |
| Clustering | Wine | AMI0.97 | 9 | |
| Clustering | Foresttype | AMI0.66 | 9 |