Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback

About

Multi-modal recommender systems (MRSs) are pivotal in diverse online web platforms and have garnered considerable attention in recent years. However, previous studies overlook the challenges of (1) noisy multi-modal content, (2) noisy user feedback, and (3) aligning multi-modal content with user feedback. In order to tackle these challenges, we propose Denoising and Aligning Multi-modal Recommender System (DA-MRS). To mitigate multi-modal noise, DA-MRS first constructs item-item graphs determined by consistent content similarity across modalities. To denoise user feedback, DA-MRS associates the probability of observed feedback with multi-modal content and devises a denoised BPR loss. Furthermore, DA-MRS implements Alignment guided by User preference to enhance task-specific item representation and Alignment guided by graded Item relations to provide finer-grained alignment. Extensive experiments verify that DA-MRS is a plug-and-play framework and achieves significant and consistent improvements across various datasets, backbone models, and noisy scenarios.

Guipeng Xv, Xinyu Li, Ruobing Xie, Chen Lin, Chong Liu, Feng Xia, Zhanhui Kang, Leyu Lin• 2024

Related benchmarks

Task	Dataset	Result
Multimodal Recommendation	Amazon Baby (test)	Recall@106.5	66
Recommendation	Amazon Baby (test)	Recall@200.0947	57
Sequential Recommendation	Amazon Office (test)	--	56
Multimodal Recommendation	Sports Amazon (test)	Recall@107.51	51
Multimodal Recommendation	Amazon Clothing (test)	Recall@106.47	37
Recommendation	MedicalRec I	HitRate@10070.72	13
Medical Recommendation	MedicalRec II	DCG@50.0356	13
Recommendation	MedicalRec III	Hit Rate@10072	13
Recommendation	MedicalRec IV	HitRate@10074	13
Medical Recommendation	MedicalRec I	DCG@59.4	13

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord