Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Disentangling Homophily and Heterophily in Multimodal Graph Clustering

About

Multimodal graphs, which integrate unstructured heterogeneous data with structured interconnections, offer substantial real-world utility but remain insufficiently explored in unsupervised learning. In this work, we initiate the study of multimodal graph clustering, aiming to bridge this critical gap. Through empirical analysis, we observe that real-world multimodal graphs often exhibit hybrid neighborhood patterns, combining both homophilic and heterophilic relationships. To address this challenge, we propose a novel framework -- \textsc{Disentangled Multimodal Graph Clustering (DMGC)} -- which decomposes the original hybrid graph into two complementary views: (1) a homophily-enhanced graph that captures cross-modal class consistency, and (2) heterophily-aware graphs that preserve modality-specific inter-class distinctions. We introduce a \emph{Multimodal Dual-frequency Fusion} mechanism that jointly filters these disentangled graphs through a dual-pass strategy, enabling effective multimodal integration while mitigating category confusion. Our self-supervised alignment objectives further guide the learning process without requiring labels. Extensive experiments on both multimodal and multi-relational graph datasets demonstrate that DMGC achieves state-of-the-art performance, highlighting its effectiveness and generalizability across diverse settings. Our code is available at https://github.com/Uncnbb/DMGC.

Zhaochen Guo, Zhixiang Shen, Xuanting Xie, Liangjian Wen, Zhao Kang• 2025

Related benchmarks

TaskDatasetResultRank
Node ClassificationMovies
Accuracy57.41
82
Node ClassificationGrocery
Accuracy81.55
71
Node ClusteringRedditS
NMI89.62
40
Modal RetrievalEle-fashion
MRR91.34
31
Link PredictionBili Dance
MRR41.37
27
Link PredictionCloth
MRR53.21
26
Graph-to-ImageSemArt
CLIP-S Score62.15
26
Node ClassificationGoodreads
Accuracy65.18
26
Node ClassificationRedditS
Accuracy91.95
23
Modality MatchingBili_music
Score78.3
18
Showing 10 of 34 rows

Other info

Follow for update