Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation

About

Multimodal recommendation aims to integrate collaborative signals with heterogeneous content such as visual and textual information, but remains challenged by modality-specific noise, semantic inconsistency, and unstable propagation over user-item graphs. These issues are often exacerbated by naive fusion or shallow modeling strategies, leading to degraded generalization and poor robustness. While recent work has explored the frequency domain as a lens to separate stable from noisy signals, most methods rely on static filtering or reweighting, lacking the ability to reason over spectral structure or adapt to modality-specific reliability. To address these challenges, we propose a Structured Spectral Reasoning (SSR) framework for frequency-aware multimodal recommendation. Our method follows a four-stage pipeline: (i) Decompose graph-based multimodal signals into spectral bands via graph-guided transformations to isolate semantic granularity; (ii) Modulate band-level reliability with spectral band masking, a training-time masking with a prediction-consistency objective that suppresses brittle frequency components; (iii) Fuse complementary frequency cues using hyperspectral reasoning with low-rank cross-band interaction; and (iv) Align modality-specific spectral features via contrastive regularization to promote semantic and structural consistency. Experiments on three real-world benchmarks show consistent gains over strong baselines, particularly under sparse and cold-start settings. Additional analyses indicate that structured spectral modeling improves robustness and provides clearer diagnostics of how different bands contribute to performance.

Wei Yang, Rui Zhong, Yiqun Chen, Chi Lu, Peng Jiang• 2025

Related benchmarks

Task	Dataset	Result
Recommendation	Amazon Baby (test)	Recall@200.1103	57
Recommendation	Amazon Sports (test)	Recall@108.25	57
Multimodal Recommendation	Pet	Recall@1012.21	35
Multimodal Recommendation	Amazon Baby	Recall@100.0728	28
Recommendation	Amazon Clothing (test)	Recall@107.08	27
Multimodal Recommendation	Amazon Sports	Recall@108.25	21
Multimodal Recommendation	Amazon Clothing	Recall@100.0708	21
Multimodal Recommendation	Amazon Beauty	Recall@1010.71	14

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord