Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Revisiting Modularity Maximization for Graph Clustering: A Contrastive Learning Perspective

About

Graph clustering, a fundamental and challenging task in graph mining, aims to classify nodes in a graph into several disjoint clusters. In recent years, graph contrastive learning (GCL) has emerged as a dominant line of research in graph clustering and advances the new state-of-the-art. However, GCL-based methods heavily rely on graph augmentations and contrastive schemes, which may potentially introduce challenges such as semantic drift and scalability issues. Another promising line of research involves the adoption of modularity maximization, a popular and effective measure for community detection, as the guiding principle for clustering tasks. Despite the recent progress, the underlying mechanism of modularity maximization is still not well understood. In this work, we dig into the hidden success of modularity maximization for graph clustering. Our analysis reveals the strong connections between modularity maximization and graph contrastive learning, where positive and negative examples are naturally defined by modularity. In light of our results, we propose a community-aware graph clustering framework, coined MAGI, which leverages modularity maximization as a contrastive pretext task to effectively uncover the underlying information of communities in graphs, while avoiding the problem of semantic drift. Extensive experiments on multiple graph datasets verify the effectiveness of MAGI in terms of scalability and clustering performance compared to state-of-the-art graph clustering methods. Notably, MAGI easily scales a sufficiently large graph with 100M nodes while outperforming strong baselines.

Yunfei Liu, Jintang Li, Yuehe Chen, Ruofan Wu, Ericbk Wang, Jing Zhou, Sheng Tian, Shuheng Shen, Xing Fu, Changhua Meng, Weiqiang Wang, Liang Chen• 2024

Related benchmarks

TaskDatasetResultRank
Node ClusteringCora
Accuracy76
115
Node ClusteringCiteseer
NMI42.8
110
Node ClusteringACM
ARI70.9
57
ClusteringDBLP
Accuracy76.4
27
Graph ClusteringPubmed
Accuracy63.3
24
Graph ClusteringWiki
ARI29.3
15
Community DetectionCora
Topsis Score0.7878
13
Community DetectionwikiCS
Topsis Score0.852
13
Community DetectionCiteseer
Topsis Score0.7963
13
Community DetectionPubmed
Topsis Score0.6958
13
Showing 10 of 21 rows

Other info

Follow for update