Graph Degree Linkage: Agglomerative Clustering on a Directed Graph
About
This paper proposes a simple but effective graph-based agglomerative algorithm, for clustering high-dimensional data. We explore the different roles of two fundamental concepts in graph theory, indegree and outdegree, in the context of clustering. The average indegree reflects the density near a sample, and the average outdegree characterizes the local geometry around a sample. Based on such insights, we define the affinity measure of clusters via the product of average indegree and average outdegree. The product-based affinity makes our algorithm robust to noise. The algorithm has three main advantages: good performance, easy implementation, and high computational efficiency. We test the algorithm on two fundamental computer vision problems: image clustering and object matching. Extensive experiments demonstrate that it outperforms the state-of-the-arts in both applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | MNIST (test) | NMI0.864 | 122 | |
| Clustering | MNIST (full) | Accuracy11.3 | 98 | |
| Clustering | Fashion MNIST | NMI66 | 95 | |
| Clustering | MNIST | NMI0.91 | 92 | |
| Clustering | USPS | NMI82.4 | 82 | |
| Image Clustering | USPS | NMI0.854 | 43 | |
| Clustering | USPS | Accuracy0.867 | 36 | |
| Image Clustering | COIL-20 | NMI0.945 | 29 | |
| Clustering | CMU-PIE | NMI93.4 | 29 | |
| Image Clustering | COIL-100 | NMI0.954 | 28 |