Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

About

Large Reasoning Models (LRMs) with long chain-of-thought reasoning have recently achieved remarkable success. Yet, equipping domain-specialized models with such reasoning capabilities, referred to as "Reasoning + X", remains a significant challenge. While model merging offers a promising training-free solution, existing methods often suffer from a destructive performance collapse: existing methods tend to both weaken reasoning depth and compromise domain-specific utility. Interestingly, we identify a counter-intuitive phenomenon underlying this failure: reasoning ability predominantly resides in parameter regions with low gradient sensitivity, contrary to the common assumption that domain capabilities correspond to high-magnitude parameters. Motivated by this insight, we propose ReasonAny, a novel merging framework that resolves the reasoning-domain performance collapse through Contrastive Gradient Identification. Experiments across safety, biomedicine, and finance domains show that ReasonAny effectively synthesizes "Reasoning + X" capabilities, significantly outperforming state-of-the-art baselines while retaining robust reasoning performance.

Junyao Yang, Chen Qian, Dongrui Liu, Wen Shen, Yong Liu, Jing Shao• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningAIME
AIME Accuracy33.33
283
Science Question AnsweringARC Challenge
Accuracy59.83
234
Code GenerationHumanEval
Pass@161.71
108
Science Question AnsweringARC Easy
Accuracy66.75
101
Code GenerationLiveCodeBench
Pass@126.48
86
ReasoningGSM8K--
83
Safety EvaluationHarmBench
Harmbench Score2
76
KnowledgeMMLU
Accuracy82.09
71
Code ReasoningHumanEval
HumanEval Score92.32
35
KnowledgeGPQA
Accuracy56.25
34
Showing 10 of 38 rows

Other info

Follow for update