Revisiting Self-Supervised Heterogeneous Graph Learning from Spectral Clustering Perspective
About
Self-supervised heterogeneous graph learning (SHGL) has shown promising potential in diverse scenarios. However, while existing SHGL methods share a similar essential with clustering approaches, they encounter two significant limitations: (i) noise in graph structures is often introduced during the message-passing process to weaken node representations, and (ii) cluster-level information may be inadequately captured and leveraged, diminishing the performance in downstream tasks. In this paper, we address these limitations by theoretically revisiting SHGL from the spectral clustering perspective and introducing a novel framework enhanced by rank and dual consistency constraints. Specifically, our framework incorporates a rank-constrained spectral clustering method that refines the affinity matrix to exclude noise effectively. Additionally, we integrate node-level and cluster-level consistency constraints that concurrently capture invariant and clustering information to facilitate learning in downstream tasks. We theoretically demonstrate that the learned representations are divided into distinct partitions based on the number of classes and exhibit enhanced generalization ability across tasks. Experimental results affirm the superiority of our method, showcasing remarkable improvements in several downstream tasks compared to existing methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | ACM | Macro F192.7 | 104 | |
| Node Classification | DBLP | Micro-F194.7 | 94 | |
| Node Clustering | ACM | ARI72.7 | 57 | |
| Node Classification | Aminer | Micro F186.8 | 46 | |
| Node Clustering | DBLP | NMI77.1 | 39 | |
| Node Classification | Yelp | Macro F1 Score93 | 15 | |
| Node Classification | Photo homogeneous (test) | Macro F1 Score91.9 | 12 | |
| Node Classification | Computers homogeneous (test) | Macro F185.9 | 12 | |
| Node Clustering | Yelp | NMI41.2 | 11 | |
| Node Clustering | Aminer | NMI42.4 | 11 |