Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging

About

Regression Mean (RegMean), an approach that formulates model merging as a linear regression problem, aims to find the optimal weights for each linear layer in the merged model by minimizing the discrepancy in predictions between the merged and candidate models. RegMean provides a precise closed-form solution for the merging problem; therefore, it offers explainability and computational efficiency. However, RegMean merges each linear layer independently, overlooking how the features and information in earlier layers propagate through deeper layers and influence the final predictions of the merged model. Here, we introduce RegMean++, a simple yet effective alternative to RegMean, that explicitly incorporates both intra-layer and cross-layer dependencies between merged models' layers into RegMean's objective. By accounting for these dependencies, RegMean++ better captures the behaviors of the merged model. Extensive experiments demonstrate that RegMean++ consistently outperforms RegMean across diverse settings, including in-domain (ID) and out-of-domain (OOD) generalization, sequential merging, large-scale tasks, and robustness under several types of distribution shifts. Furthermore, RegMean++ achieves competitive performance across diverse settings compared to various advanced model merging methods.

The-Hai Nguyen, Dang Huu-Tien, Takeshi Suzuki, Le-Minh Nguyen• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationStanford Cars
Accuracy70.8
660
Image ClassificationEuroSAT
Accuracy44.1
569
Image ClassificationRESISC45
Accuracy90.2
472
Image ClassificationSUN397
Accuracy69.8
450
Image ClassificationMNIST
Accuracy81.3
398
Image ClassificationSVHN
Accuracy95.5
395
Image ClassificationGTSRB
Accuracy93.2
291
Image Classification8-task vision benchmark
Average Accuracy91.9
193
Image Classification20 tasks collection
Average Absolute Accuracy77.8
96
Visual Classification8 Vision Tasks (SUN397, Stanford Cars, RESISC45, EuroSAT, SVHN, GTSRB, MNIST, DTD)
Average Accuracy91
86
Showing 10 of 31 rows

Other info

Follow for update