Hypergraph Self-supervised Learning with Sampling-efficient Signals

About

Self-supervised learning (SSL) provides a promising alternative for representation learning on hypergraphs without costly labels. However, existing hypergraph SSL models are mostly based on contrastive methods with the instance-level discrimination strategy, suffering from two significant limitations: (1) They select negative samples arbitrarily, which is unreliable in deciding similar and dissimilar pairs, causing training bias. (2) They often require a large number of negative samples, resulting in expensive computational costs. To address the above issues, we propose SE-HSSL, a hypergraph SSL framework with three sampling-efficient self-supervised signals. Specifically, we introduce two sampling-free objectives leveraging the canonical correlation analysis as the node-level and group-level self-supervised signals. Additionally, we develop a novel hierarchical membership-level contrast objective motivated by the cascading overlap relationship in hypergraphs, which can further reduce membership sampling bias and improve the efficiency of sample utilization. Through comprehensive experiments on 7 real-world hypergraphs, we demonstrate the superiority of our approach over the state-of-the-art method in terms of both effectiveness and efficiency.

Fan Li, Xiaoyang Wang, Dawei Cheng, Wenjie Zhang, Ying Zhang, Xuemin Lin• 2024

Related benchmarks

Task	Dataset	Result
Node Classification	Citeseer	Accuracy65.83	1037
Node Classification	Photo	Accuracy51.8	254
Node Classification	Computers	Accuracy41.63	145
Node Clustering	Citeseer	NMI43.8	140
Node Classification	Cora	Accuracy64.3	134
Node Classification	History	Accuracy74.88	29
Hyperedge prediction	Citeseer	Accuracy73.09	18
Hyperedge prediction	Cora	Accuracy65.86	18
Hyperedge prediction	Photo	Accuracy71.27	15
Hyperedge prediction	Computers	Accuracy71.19	15

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord