Deep Fusion Clustering Network
About
Deep clustering is a fundamental yet challenging task for data analysis. Recently we witness a strong tendency of combining autoencoder and graph neural networks to exploit structure information for clustering performance enhancement. However, we observe that existing literature 1) lacks a dynamic fusion mechanism to selectively integrate and refine the information of graph structure and node attributes for consensus representation learning; 2) fails to extract information from both sides for robust target distribution (i.e., "groundtruth" soft labels) generation. To tackle the above issues, we propose a Deep Fusion Clustering Network (DFCN). Specifically, in our network, an interdependency learning-based Structure and Attribute Information Fusion (SAIF) module is proposed to explicitly merge the representations learned by an autoencoder and a graph autoencoder for consensus representation learning. Also, a reliable target distribution generation measure and a triplet self-supervision strategy, which facilitate cross-modality information exploitation, are designed for network training. Extensive experiments on six benchmark datasets have demonstrated that the proposed DFCN consistently outperforms the state-of-the-art deep clustering methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Clustering | Cora | NMI19.36 | 168 | |
| Node Clustering | Citeseer | NMI43.9 | 140 | |
| Graph Clustering | AMAP | Accuracy76.88 | 35 | |
| Graph Clustering | Wiki | ARI17.17 | 27 | |
| Node Clustering | PubMed (test) | Accuracy68.89 | 25 | |
| Clustering | ACM (test) | Accuracy (ACC)90.9 | 15 | |
| Clustering | DBLP (test) | Accuracy76 | 15 | |
| Clustering | CITE (test) | Accuracy69.5 | 15 | |
| Clustering | AMAP (test) | Accuracy76.88 | 15 | |
| Attributed Graph Clustering | Cornell | Accuracy39.72 | 12 |