Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

About

Large Reasoning Models (LRMs) with long chain-of-thought reasoning have recently achieved remarkable success. Yet, equipping domain-specialized models with such reasoning capabilities, referred to as "Reasoning + X", remains a significant challenge. While model merging offers a promising training-free solution, existing methods often suffer from a destructive performance collapse: existing methods tend to both weaken reasoning depth and compromise domain-specific utility. Interestingly, we identify a counter-intuitive phenomenon underlying this failure: reasoning ability predominantly resides in parameter regions with low gradient sensitivity, contrary to the common assumption that domain capabilities correspond to high-magnitude parameters. Motivated by this insight, we propose ReasonAny, a novel merging framework that resolves the reasoning-domain performance collapse through Contrastive Gradient Identification. Experiments across safety, biomedicine, and finance domains show that ReasonAny effectively synthesizes "Reasoning + X" capabilities, significantly outperforming state-of-the-art baselines while retaining robust reasoning performance.

Junyao Yang, Chen Qian, Dongrui Liu, Wen Shen, Yong Liu, Jing Shao• 2026

Related benchmarks

TaskDatasetResultRank
Science Question AnsweringARC Challenge
Accuracy59.83
354
Mathematical ReasoningAIME
AIME Accuracy33.33
288
Code GenerationHumanEval
Pass@161.71
171
Science Question AnsweringARC Easy
Accuracy66.75
162
KnowledgeMMLU
Accuracy82.09
161
Safety EvaluationHarmBench
Harmbench Score2
127
General Knowledge EvaluationMMLU
MMLU Accuracy73.01
127
ReasoningGSM8K--
111
Code GenerationLiveCodeBench
Pass@126.48
86
Code ReasoningHumanEval
HumanEval Score92.32
62
Showing 10 of 38 rows

Other info

Follow for update