Superpose Task-specific Features for Model Merging

About

Model merging enables powerful capabilities in neural networks without requiring additional training. In this paper, we introduce a novel perspective on model merging by leveraging the fundamental mechanisms of neural network representation. Our approach is motivated by the linear representation hypothesis, which states that neural networks encode information through linear combinations of feature vectors. We propose a method that superposes task-specific features from individual models into a merged model. Our approach specifically targets linear transformation matrices, which are crucial for feature activation and extraction in deep networks. By formulating the merging process as a linear system, we can preserve task-specific features from individual models and create merged models that effectively maintain multi-task capabilities compared to existing methods. Extensive experiments across diverse benchmarks and models demonstrate that our method outperforms existing techniques. Code is available at https://github.com/LARS-research/STF.

Haiquan Qiu, You Wu, Dong Li, Jianmin Guo, Quanming Yao• 2025

Related benchmarks

Task	Dataset	Result
Multimodal Understanding	MMStar	Accuracy31.51	511
Multimodal Understanding	SEEDBench2 Plus	Accuracy34.43	138
Multimodal Understanding	MMMU	Accuracy30.67	34
Multilingual Multimodal Multiple-Choice Question Answering	Afri-MCQA	Average Accuracy23.86	15
Visual Question Answering	CVQA	--	14
Multimodal Understanding	XMMMU	Avg_mul32.05	11
Multilingual Visual Question Answering	MaXM	Avg. Score (MaXM)11.09	11
Multicultural Visual Reasoning	MaRVL	Avg_mul Score50.87	10
Visual Question Answering	xGQA	Avg_mul Score10.87	10

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord