Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Modality-Independent Graph Neural Networks with Global Transformers for Multimodal Recommendation

About

Multimodal recommendation systems can learn users' preferences from existing user-item interactions as well as the semantics of multimodal data associated with items. Many existing methods model this through a multimodal user-item graph, approaching multimodal recommendation as a graph learning task. Graph Neural Networks (GNNs) have shown promising performance in this domain. Prior research has capitalized on GNNs' capability to capture neighborhood information within certain receptive fields (typically denoted by the number of hops, $K$) to enrich user and item semantics. We observe that the optimal receptive fields for GNNs can vary across different modalities. In this paper, we propose GNNs with Modality-Independent Receptive Fields, which employ separate GNNs with independent receptive fields for different modalities to enhance performance. Our results indicate that the optimal $K$ for certain modalities on specific datasets can be as low as 1 or 2, which may restrict the GNNs' capacity to capture global information. To address this, we introduce a Sampling-based Global Transformer, which utilizes uniform global sampling to effectively integrate global information for GNNs. We conduct comprehensive experiments that demonstrate the superiority of our approach over existing methods. Our code is publicly available at https://github.com/CrawlScript/MIG-GT.

Jun Hu, Bryan Hooi, Bingsheng He, Yinwei Wei• 2024

Related benchmarks

TaskDatasetResultRank
RecommendationAmazon Sports (test)
Recall@107.53
57
RecommendationAmazon Baby (test)
Recall@200.1021
57
Multimodal RecommendationAmazon Baby (test)
Recall@106.65
54
Node ClassificationMovies
Accuracy54.93
47
Multimodal RecommendationSports Amazon (test)
Recall@107.53
39
Modal RetrievalEle-fashion
MRR92.54
31
Node ClusteringRedditS
NMI86.25
31
RecommendationAmazon Clothing (test)
Recall@106.36
27
Link PredictionBili Dance
MRR37.82
27
Multimodal RecommendationAmazon Clothing (test)
Recall@106.36
25
Showing 10 of 20 rows

Other info

Code

Follow for update