Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FREE-Merging: Fourier Transform for Efficient Model Merging

About

With the rapid growth of deep learning, there is an increasing availability of open-source models for various tasks. However, single fine-tuned models often fall short of meeting the diverse needs of users. Model merging has thus emerged as an efficient method to integrate the capabilities of existing models into a unified model. Nevertheless, existing model merging methods face challenging trade-offs between performance and deployment costs, primarily due to task interference. For the first time, we reveal that task interference is evident in the frequency domain of model parameters, yet current efforts only focus on spatial domain solutions, which are largely ineffective in addressing frequency domain interference. To mitigate the impact of frequency domain interference, we propose FR-Merging, an innovative method that effectively filters harmful frequency domain interference on the backbone with minimal computational overhead. Since performance loss is inevitable with cost-free methods, we propose a lightweight task-specific expert module that dynamically compensates for information loss during merging. This proposed framework, FREE-Merging (FR-Merging with experts), strikes a balanced trade-off between training cost, inference latency, storage requirements, and performance. We demonstrate the effectiveness of both FR-Merging and FREE-Merging on multiple tasks across CV, NLP, and Multi-Modal domains and show that they can be flexibly adapted to specific needs.

Shenghe Zheng, Hongzhi Wang• 2024

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringVizWiz
Accuracy52.86
1820
Visual Question AnsweringChartQA--
519
Image ClassificationSVHN (test)
Accuracy96.3
470
Visual Question AnsweringScienceQA
Accuracy71.06
446
Image ClassificationDTD (test)
Accuracy75.4
316
Instruction FollowingMT-Bench
MT-Bench Score8.3
287
Visual Question AnsweringOK-VQA
Accuracy31.31
272
Image ClassificationSUN397 (test)
Top-1 Accuracy76.4
231
Image ClassificationEuroSAT (test)
Accuracy99.5
177
Visual Question AnsweringGQA
Accuracy61.92
155
Showing 10 of 63 rows

Other info

Follow for update