Model Stock: All we need is just a few fine-tuned models

About

This paper introduces an efficient fine-tuning method for large pre-trained models, offering strong in-distribution (ID) and out-of-distribution (OOD) performance. Breaking away from traditional practices that need a multitude of fine-tuned models for averaging, our approach employs significantly fewer models to achieve final weights yet yield superior accuracy. Drawing from key insights in the weight space of fine-tuned weights, we uncover a strong link between the performance and proximity to the center of weight space. Based on this, we introduce a method that approximates a center-close weight using only two fine-tuned models, applicable during or after training. Our innovative layer-wise weight averaging technique surpasses state-of-the-art model methods such as Model Soup, utilizing only two fine-tuned models. This strategy can be aptly coined Model Stock, highlighting its reliance on selecting a minimal number of models to draw a more optimized-averaged model. We demonstrate the efficacy of Model Stock with fine-tuned models based upon pre-trained CLIP architectures, achieving remarkable performance on both ID and OOD tasks on the standard benchmarks, all while barely bringing extra computational demands. Our code and pre-trained models are available at https://github.com/naver-ai/model-stock.

Dong-Hwan Jang, Sangdoo Yun, Dongyoon Han• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy59.67	1398
Mathematical Reasoning	MATH	Accuracy16.64	882
Multiple-choice Question Answering	MMLU-Pro	MMLU-Pro Overall Accuracy36.8	130
Image Classification	ImageNet Rendition	Top-1 Accuracy71.77	113
Multiple-choice Question Answering	SciQ	Accuracy95.2	91
Safety Alignment	HarmBench	ASR17.25	88
Code Generating	MBPP	Pass@147.8	88
Class-incremental learning	CIFAR100 10 Tasks	Accuracy70.7	66
Class-incremental learning	ImageNet-R 5-task	--	64
Class-incremental learning	CIFAR-100 20 tasks	Accuracy68.3	58

Showing 10 of 39 rows

Other info

Follow for update

@wizwand_team Discord