Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

About

Large Reasoning Models (LRMs) with long chain-of-thought reasoning have recently achieved remarkable success. Yet, equipping domain-specialized models with such reasoning capabilities, referred to as "Reasoning + X", remains a significant challenge. While model merging offers a promising training-free solution, existing methods often suffer from a destructive performance collapse: existing methods tend to both weaken reasoning depth and compromise domain-specific utility. Interestingly, we identify a counter-intuitive phenomenon underlying this failure: reasoning ability predominantly resides in parameter regions with low gradient sensitivity, contrary to the common assumption that domain capabilities correspond to high-magnitude parameters. Motivated by this insight, we propose ReasonAny, a novel merging framework that resolves the reasoning-domain performance collapse through Contrastive Gradient Identification. Experiments across safety, biomedicine, and finance domains show that ReasonAny effectively synthesizes "Reasoning + X" capabilities, significantly outperforming state-of-the-art baselines while retaining robust reasoning performance.

Junyao Yang, Chen Qian, Dongrui Liu, Wen Shen, Yong Liu, Jing Shao• 2026

Related benchmarks

TaskDatasetResultRank
Science Question AnsweringARC Challenge
Accuracy59.83
342
Mathematical ReasoningAIME
AIME Accuracy33.33
288
Code GenerationHumanEval
Pass@161.71
171
Science Question AnsweringARC Easy
Accuracy66.75
155
KnowledgeMMLU
Accuracy82.09
136
Safety EvaluationHarmBench
Harmbench Score2
112
ReasoningGSM8K--
106
Code GenerationLiveCodeBench
Pass@126.48
86
KnowledgeGPQA
Accuracy56.25
51
General Knowledge EvaluationMMLU
MMLU Accuracy73.01
45
Showing 10 of 38 rows

Other info

Follow for update