Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

About

Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by accumulating task vectors -- the parameter differences between pretrained and finetuned models. However, task vector accumulation is often hindered by knowledge conflicts, leading to performance degradation. To address this challenge, we propose Conflict-Aware Task Merging (CAT Merging), a novel training-free framework that selectively trims conflict-prone components from the task vectors. CAT Merging introduces several parameter-specific strategies, including projection for linear weights and masking for scaling and shifting parameters in normalization layers. Extensive experiments on vision, language, and vision-language tasks demonstrate that CAT Merging effectively suppresses knowledge conflicts, achieving average accuracy improvements of up to 2.5% (ViT-B/32) and 2.0% (ViT-L/14) over state-of-the-art methods.

Wenju Sun, Qingyong Li, Yangli-ao Geng, Boyang Li• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationSVHN (test)
Accuracy85.5
470
Image ClassificationDTD (test)
Accuracy60.7
316
Image ClassificationSUN397 (test)
Top-1 Accuracy68.1
231
Image ClassificationEuroSAT (test)
Accuracy89.5
177
Image ClassificationMNIST (test)
Accuracy98.6
138
Image ClassificationGTSRB (test)
Accuracy (Clean)78.5
94
Image ClassificationCars (test)
Accuracy65.4
73
Image ClassificationResisc45 (test)--
62
Image Classification8 vision tasks Average
Average Accuracy78.3
53
Showing 9 of 9 rows

Other info

Follow for update