Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging

About

Regression Mean (RegMean), an approach that formulates model merging as a linear regression problem, aims to find the optimal weights for each linear layer in the merge model by minimizing the discrepancy in predictions between the merge and candidate models. RegMean provides a precise closed-form solution for the merging problem; therefore, it offers explainability and computational efficiency. However, RegMean merges each linear layer independently, overlooking how the features and information in the earlier layers propagate through the layers and influence the final prediction in the merge model. In this paper, we introduce RegMean++, a simple yet effective alternative to RegMean, that explicitly incorporates both intra- and cross-layer dependencies between merge models' layers into RegMean's objective. By accounting for these dependencies, RegMean++ better captures the behaviors of the merge model. Extensive experiments demonstrate that RegMean++ consistently outperforms RegMean across diverse settings, including in-domain (ID) and out-of-domain (OOD) generalization, sequential merging, large-scale tasks, and robustness under several types of distribution shifts. Furthermore, RegMean++ achieves competitive or state-of-the-art performance compared to various recent advanced model merging methods. Our code is available at https://github.com/nthehai01/RegMean-plusplus.

The-Hai Nguyen, Dang Huu-Tien, Takeshi Suzuki, Le-Minh Nguyen• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationEuroSAT
Accuracy44.1
497
Image ClassificationStanford Cars
Accuracy70.8
477
Image ClassificationSVHN
Accuracy95.5
359
Image ClassificationGTSRB
Accuracy93.2
291
Image ClassificationRESISC45
Accuracy90.2
263
Image ClassificationMNIST
Accuracy81.3
263
Image ClassificationSUN397
Accuracy69.8
246
Image ClassificationTALL-14
Accuracy87.9
28
Image ClassificationTALL-20
Accuracy82.5
28
Image ClassificationTA-8
Accuracy88.3
28
Showing 10 of 10 rows

Other info

Follow for update