$\texttt{SynC}$: Synergistic Boosting of Structure and Representation for Deep Graph Clustering
About
Employing graph neural networks (GNNs) for graph clustering has shown promising results in deep graph clustering. However, existing methods disregard the reciprocal relationship between representation learning and structure augmentation: the more homogeneous the graph, the more cohesive the node representations; the more cohesive the node representations, the more reliable the structure augmentation becomes. Moreover, the generalization ability of existing GNN-based models on the low homophily graph is relatively poor. To this end, we propose a graph clustering framework named Synergistic Deep Graph Clustering Network (SynC). SynC employs a Transform Input Graph Auto-Encoder (TIGAE) to obtain high-quality embeddings via mitigating the representations collapse issue of GAE for guiding structure augmentation. Then, we re-capture neighborhood representations on the refined graph to obtain clustering-friendly embeddings and conduct self-supervised clustering. Notably, these two stages share weights, resulting in synergistic boosting while significantly reducing the number of model parameters. Additionally, we introduce a structure fine-tuning strategy to improve the model's generalization on the low homophily graph. Extensive experiments on benchmark datasets demonstrate the superiority of SynC. The code is released at GitHub.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Clustering | Cora (test) | Accuracy78.58 | 27 | |
| Node Clustering | PubMed (test) | Accuracy71.96 | 25 | |
| Clustering | ACM (test) | Accuracy (ACC)92.73 | 15 | |
| Clustering | DBLP (test) | Accuracy83.48 | 15 | |
| Clustering | AMAP (test) | Accuracy82.48 | 15 | |
| Clustering | CITE (test) | Accuracy71.77 | 15 | |
| Clustering | UAT (test) | ACC60.39 | 10 | |
| Graph Clustering | ACM | Pre-training Time (s)2.7 | 5 |