Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Model merging with SVD to tie the Knots

About

Recent model merging methods demonstrate that the parameters of fully-finetuned models specializing in distinct tasks can be combined into one model capable of solving all tasks without retraining. Yet, this success does not transfer well when merging LoRA finetuned models. We study this phenomenon and observe that the weights of LoRA finetuned models showcase a lower degree of alignment compared to their fully-finetuned counterparts. We hypothesize that improving this alignment is key to obtaining better LoRA model merges, and propose KnOTS to address this problem. KnOTS uses the SVD to jointly transform the weights of different LoRA models into an aligned space, where existing merging methods can be applied. In addition, we introduce a new benchmark that explicitly evaluates whether merged models are general models. Notably, KnOTS consistently improves LoRA merging by up to 4.3% across several vision and language benchmarks, including our new setting. We release our code at: https://github.com/gstoica27/KnOTS.

George Stoica, Pratik Ramesh, Boglarka Ecsedi, Leshem Choshen, Judy Hoffman• 2024

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringVizWiz
Accuracy77
1525
Visual Question AnsweringChartQA--
371
Visual Question AnsweringScienceQA--
370
Image ClassificationDomainNet--
206
Image CaptioningCOCO--
130
Image Classification20 Vision Classification Tasks
Average Accuracy78.8
94
Image Classification14 Vision Tasks
Average Accuracy81.1
84
Image Classification8 Vision Tasks (test)
Avg Accuracy55.93
82
Image ClassificationSUN397, Cars, RESISC45, EuroSAT, SVHN, GTSRB, MNIST, DTD (test)
SUN39797.9
80
Image Classification8-task vision benchmark
Average Accuracy87.8
64
Showing 10 of 57 rows

Other info

Follow for update