Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Personalized Parameter-Efficient Fine-Tuning of Foundation Models for Multimodal Recommendation

About

In recent years, substantial research has integrated multimodal item metadata into recommender systems, often by using pre-trained multimodal foundation models to encode such data. Since these models are not originally trained for recommendation tasks, recent works efficiently adapt them via parameter-efficient fine-tuning (PEFT). However, even with PEFT, item embeddings from multimodal foundation models remain user-blind: item embeddings are not conditioned on user interests, despite the fact that users with diverse interests attend to different item aspects. To address this limitation, we propose PerPEFT, a personalized PEFT strategy for multimodal recommendation. Specifically, PerPEFT groups users by interest and assigns a distinct PEFT module to each group, enabling each module to capture the fine-grained item aspects most predictive of that group`s purchase decisions. We further introduce a specialized training technique that strengthens this user-group conditioning. Notably, PerPEFT is PEFT-agnostic and can be paired with any PEFT method applicable to multimodal foundation models. Through extensive experiments, we show that (1) PerPEFT outperforms the strongest baseline by up to 15.3% (NDCG@20) and (2) delivers consistent gains across diverse PEFT variants. It is noteworthy that, even with personalization, PEFT remains lightweight, adding only 1.3% of the parameter count of the foundation model. We provide our code and datasets at https://github.com/kswoo97/PerPEFT.

Sunwoo Kim, Hyunjin Hwang, Kijung Shin• 2026

Related benchmarks

TaskDatasetResultRank
Multimodal RecommendationSports and Outdoors
H@204.82
20
Multimodal RecommendationToys & Games
H@204.81
20
Multimodal RecommendationBeauty & Personal Care
Hit Rate @204.33
20
Multimodal RecommendationArts Crafts and Sewing
Hit Rate @206.62
20
Multimodal RecommendationAmazon Sports & Outdoors latest (subsampled 20% users)
H@204.82
3
Multimodal RecommendationAmazon Toys & Games subsampled 20% users latest
Hit Rate@204.81
3
Multimodal RecommendationAmazon Arts, Crafts, & Sewing latest (subsampled 40% users)
H@206.56
3
Showing 7 of 7 rows

Other info

Follow for update