Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Model soups need only one ingredient

About

Fine-tuning large pre-trained models on a target distribution often improves in-distribution (ID) accuracy, but at the cost of out-of-distribution (OOD) robustness as representations specialize to the fine-tuning data. Weight-space ensembling methods, such as Model Soups, mitigate this effect by averaging multiple checkpoints, but they are computationally prohibitive, requiring the training and storage of dozens of fine-tuned models. In this paper, we introduce MonoSoup, a simple, data-free, hyperparameter-free, post-hoc method that achieves a strong ID-OOD balance using only a single checkpoint. Our method applies Singular Value Decomposition (SVD) to each layer's update and decomposes it into high-energy directions that capture task-specific adaptation and low-energy directions that introduce noise but may still encode residual signals useful for robustness. MonoSoup then uses entropy-based effective rank to automatically re-weigh these components with layer-wise coefficients that account for the spectral and geometric structure of the model. Experiments on CLIP models fine-tuned on ImageNet and evaluated under natural distribution shifts, as well as on Qwen language models tested on mathematical reasoning and multiple-choice benchmarks, show that this plug-and-play approach is a practical and effective alternative to multi-checkpoint methods, retaining much of their benefits without their computational overhead.

Alireza Abdollahpoorrostam, Nikolaos Dimitriadis, Adam Hazimeh, Pascal Frossard• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy56.9
983
Image ClassificationImageNet-1k 1.0 (test)
Top-1 Accuracy80.34
197
Multiple-choice Question AnsweringMMLU-Pro
MMLU-Pro Overall Accuracy37.5
116
Multiple-choice Question AnsweringSciQ
Accuracy95.3
74
Mathematical ReasoningGSM8K Platinum
Accuracy59.4
37
Mathematical ReasoningGSMPlus
Accuracy31.9
23
Image ClassificationImageNet OOD (Avg of V2, R, Sketch, A, and ObjectNet) 1.0 (test)
Top-1 Acc51.6
17
Image ClassificationImageNet In-distribution 1k
Accuracy85.57
4
Image ClassificationAvg OOD (test)
Accuracy56.7
4
Showing 9 of 9 rows

Other info

Follow for update