Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Towards Transfer-Efficient Multi-modal Sequential Recommendation with State Space Duality

About

Sequential Recommendation (SR) models infer user preferences from interaction histories. While transferable Multi-modal SR models outperform traditional ID-based approaches, existing methods struggle with slow fine-tuning convergence due to complex optimization requirements and negative transfer effects. We propose MMM4Rec (Multi-Modal Mamba for Sequential Recommendation), a novel Multi-modal SR framework that incorporates a dedicated algebraic constraint mechanism for efficient transfer learning. By combining State Space Duality (SSD)'s temporal decay properties with a globally-aware temporal modeling design, our model dynamically prioritizes key modality information, overcoming limitations of Transformer-based approaches. The framework implements a constrained two-stage process: (1) sequence-level cross-modal alignment via shared projection matrices, followed by (2) temporal fusion using our newly designed Cross-SSD module and dual-channel Fourier adaptive filtering. This architecture maintains semantic consistency while suppressing noise propagation. MMM4Rec achieves rapid fine-tuning convergence with simple cross-entropy loss, significantly improving Multi-modal recommendation accuracy while maintaining strong transferability. Extensive experiments demonstrate MMM4Rec's state-of-the-art performance, achieving strong multi-modal retrieval capability and exhibiting 10x faster average convergence speed when transferring to large-scale downstream datasets. The implementation is available at https://github.com/AlwaysFHao/MMM4Rec .

Hao Fan, Qingyang Liu, Hongjiu Liu, Yanrong Hu, Kai Fang• 2025

Related benchmarks

TaskDatasetResultRank
Sequential RecommendationAmazon Instruments (test)
NDCG@108.22
35
Sequential RecommendationOFFICE--
22
Sequential RecommendationInstruments--
20
Sequential RecommendationArts--
18
Sequential RecommendationAmazon Pantry (test)
NDCG@100.0481
14
Sequential RecommendationAmazon Scientific (test)
R@1013.48
12
Sequential RecommendationAmazon Pantry
Recall@108.85
9
Sequential RecommendationAmazon Scientific
Recall@1012.78
9
Multi-modal RetrievalOffice full-modality
R@1014.67
5
Sequential RecommendationSCIENTIFIC
Epochs13
3
Showing 10 of 11 rows

Other info

Follow for update