Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

About

Model merging has emerged as a promising paradigm for composing the capabilities of large language models by directly operating in weight space, enabling the integration of specialized models without costly retraining. However, existing merging methods largely rely on parameter-space heuristics, which often introduce severe interference, leading to degraded generalization and unstable generation behaviors such as repetition and incoherent outputs. In this work, we propose Sparse Complementary Fusion with reverse KL (SCF-RKL), a novel model merging framework that explicitly controls functional interference through sparse, distribution-aware updates. Instead of assuming linear additivity in parameter space, SCF-RKL measures the functional divergence between models using reverse Kullback-Leibler divergence and selectively incorporates complementary parameters. This mode-seeking, sparsity-inducing design effectively preserves stable representations while integrating new capabilities. We evaluate SCF-RKL across a wide range of model scales and architectures, covering both reasoning-focused and instruction-tuned models. Extensive experiments on 24 benchmarks spanning advanced reasoning, general reasoning and knowledge, instruction following, and safety demonstrate, vision classification that SCF-RKL consistently outperforms existing model merging methods while maintaining strong generalization and generation stability.

Weihong Lin, Lin Sun, Qilong Shi, Aomufei Yuan, Yuxuan Tian, Zhengyang Wang, Guangxiang Zhao, Xiangzheng Zhang, Tong Yang• 2026

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval
Pass@183.23
850
Image ClassificationDTD
Accuracy40.9
419
Image ClassificationSVHN
Accuracy86.9
359
ClassificationCars
Accuracy56.1
314
Image ClassificationGTSRB
Accuracy53.9
291
Image ClassificationMNIST
Accuracy89.6
263
Image ClassificationRESISC45
Accuracy60.4
263
Image ClassificationSUN397
Accuracy59.8
246
Mathematical ReasoningGSM8K
pass@193.42
102
Instruction FollowingIFBench
Pass@1 (Strict)7.46
68
Showing 10 of 20 rows

Other info

Follow for update