Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Superpose Task-specific Features for Model Merging

About

Model merging enables powerful capabilities in neural networks without requiring additional training. In this paper, we introduce a novel perspective on model merging by leveraging the fundamental mechanisms of neural network representation. Our approach is motivated by the linear representation hypothesis, which states that neural networks encode information through linear combinations of feature vectors. We propose a method that superposes task-specific features from individual models into a merged model. Our approach specifically targets linear transformation matrices, which are crucial for feature activation and extraction in deep networks. By formulating the merging process as a linear system, we can preserve task-specific features from individual models and create merged models that effectively maintain multi-task capabilities compared to existing methods. Extensive experiments across diverse benchmarks and models demonstrate that our method outperforms existing techniques. Code is available at https://github.com/LARS-research/STF.

Haiquan Qiu, You Wu, Dong Li, Jianmin Guo, Quanming Yao• 2025

Related benchmarks

TaskDatasetResultRank
Multimodal UnderstandingMMStar
Accuracy31.51
407
Multimodal UnderstandingSEEDBench2 Plus
Accuracy34.43
138
Multimodal UnderstandingMMMU
Accuracy30.67
34
Multilingual Multimodal Multiple-Choice Question AnsweringAfri-MCQA
Average Accuracy23.86
15
Visual Question AnsweringCVQA--
14
Multimodal UnderstandingXMMMU
Avg_mul32.05
11
Multilingual Visual Question AnsweringMaXM
Avg. Score (MaXM)11.09
11
Multicultural Visual ReasoningMaRVL
Avg_mul Score50.87
10
Visual Question AnsweringxGQA
Avg_mul Score10.87
10
Showing 9 of 9 rows

Other info

Follow for update