Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

About

Model merging integrates the weights of multiple task-specific models into a single multi-task model. Despite recent interest in the problem, a significant performance gap between the combined and single-task models remains. In this paper, we investigate the key characteristics of task matrices -- weight update matrices applied to a pre-trained model -- that enable effective merging. We show that alignment between singular components of task-specific and merged matrices strongly correlates with performance improvement over the pre-trained model. Based on this, we propose an isotropic merging framework that flattens the singular value spectrum of task matrices, enhances alignment, and reduces the performance gap. Additionally, we incorporate both common and task-specific subspaces to further improve alignment and performance. Our proposed approach achieves state-of-the-art performance on vision and language tasks across various sets of tasks and model scales. This work advances the understanding of model merging dynamics, offering an effective methodology to merge models without requiring additional training. Code is available at https://github.com/danielm1405/iso-merging .

Daniel Marczak, Simone Magistri, Sebastian Cygert, Bart{\l}omiej Twardowski, Andrew D. Bagdanov, Joost van de Weijer• 2025

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringVizWiz
Accuracy43.99
1820
ClassificationCars
Accuracy74.2
492
Image ClassificationDTD
Accuracy87.1
487
Image ClassificationRESISC45
Accuracy89.8
472
Image ClassificationSVHN (test)
Accuracy76.8
470
Image ClassificationSUN397
Accuracy72.6
450
Instruction FollowingAlpacaEval
Win Rate29.5
420
Image ClassificationMNIST
Accuracy79.9
398
Image ClassificationSVHN
Accuracy83.7
395
Code GenerationHumanEval+
Pass@150.7
393
Showing 10 of 184 rows
...

Other info

Follow for update