Towards Transfer-Efficient Multi-modal Sequential Recommendation with State Space Duality
About
Sequential Recommendation (SR) models infer user preferences from interaction histories. While transferable Multi-modal SR models outperform traditional ID-based approaches, existing methods struggle with slow fine-tuning convergence due to complex optimization requirements and negative transfer effects. We propose MMM4Rec (Multi-Modal Mamba for Sequential Recommendation), a novel Multi-modal SR framework that incorporates a dedicated algebraic constraint mechanism for efficient transfer learning. By combining State Space Duality (SSD)'s temporal decay properties with a globally-aware temporal modeling design, our model dynamically prioritizes key modality information, overcoming limitations of Transformer-based approaches. The framework implements a constrained two-stage process: (1) sequence-level cross-modal alignment via shared projection matrices, followed by (2) temporal fusion using our newly designed Cross-SSD module and dual-channel Fourier adaptive filtering. This architecture maintains semantic consistency while suppressing noise propagation. MMM4Rec achieves rapid fine-tuning convergence with simple cross-entropy loss, significantly improving Multi-modal recommendation accuracy while maintaining strong transferability. Extensive experiments demonstrate MMM4Rec's state-of-the-art performance, achieving strong multi-modal retrieval capability and exhibiting 10x faster average convergence speed when transferring to large-scale downstream datasets. The implementation is available at https://github.com/AlwaysFHao/MMM4Rec .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sequential Recommendation | Amazon Instruments (test) | NDCG@108.22 | 35 | |
| Sequential Recommendation | OFFICE | -- | 22 | |
| Sequential Recommendation | Instruments | -- | 20 | |
| Sequential Recommendation | Arts | -- | 18 | |
| Sequential Recommendation | Amazon Pantry (test) | NDCG@100.0481 | 14 | |
| Sequential Recommendation | Amazon Scientific (test) | R@1013.48 | 12 | |
| Sequential Recommendation | Amazon Pantry | Recall@108.85 | 9 | |
| Sequential Recommendation | Amazon Scientific | Recall@1012.78 | 9 | |
| Multi-modal Retrieval | Office full-modality | R@1014.67 | 5 | |
| Sequential Recommendation | SCIENTIFIC | Epochs13 | 3 |