Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Guiding Reasoning in Small Language Models with LLM Assistance

About

The limited reasoning capabilities of small language models (SLMs) cast doubt on their suitability for tasks demanding deep, multi-step logical deduction. This paper introduces a framework called Small Reasons, Large Hints (SMART), which selectively augments SLM reasoning with targeted guidance from large language models (LLMs). Inspired by the concept of cognitive scaffolding, SMART employs a score-based evaluation to identify uncertain reasoning steps and injects corrective LLM-generated reasoning only when necessary. By framing structured reasoning as an optimal policy search, our approach steers the reasoning trajectory toward correct solutions without exhaustive sampling. Our experiments on mathematical reasoning datasets demonstrate that targeted external scaffolding significantly improves performance, paving the way for collaborative use of both SLM and LLM to tackle complex reasoning tasks that are currently unsolvable by SLMs alone.

Yujin Kim, Euiin Yi, Minu Kim, Se-Young Yun, Taehyeon Kim• 2025

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH 500
Accuracy (Acc)83.2
543
Mathematical ReasoningAIME 24
Accuracy68.33
318
Mathematical ReasoningMATH 500
Accuracy94
79
Scientific ReasoningGPQA Diamond--
54
Mathematical ReasoningMATH-500 1.0 (test)
Accuracy85.4
26
General Reasoning PerformanceAggregate (MATH-500, AIME24, AIME25, GPQA-diamond)
Accuracy68.71
15
Mathematical ReasoningAIME 25
Accuracy (AIME 25)58.96
15
Showing 7 of 7 rows

Other info

Follow for update