Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

About

Recent advancements in large language models (LLMs) have greatly improved their capabilities on complex reasoning tasks through Long Chain-of-Thought (CoT). However, this approach often results in substantial redundancy, impairing computational efficiency and causing significant delays in real-time applications. To improve the efficiency, current methods often rely on human-defined difficulty priors, which do not align with the LLM's self-awared difficulty, leading to inefficiencies. In this paper, we introduce the Dynamic Reasoning-Boundary Self-Awareness Framework (DR. SAF), which enables models to dynamically assess and adjust their reasoning depth in response to problem complexity. DR. SAF integrates three key components: Boundary Self-Awareness Alignment, Adaptive Reward Management, and a Boundary Preservation Mechanism. These components allow models to optimize their reasoning processes, balancing efficiency and accuracy without compromising performance. Our experimental results demonstrate that DR. SAF achieves a 49.27% reduction in total response tokens with minimal loss in accuracy. The framework also delivers a 6.59x gain in token efficiency and a 5x reduction in training time, making it well-suited to resource-limited settings. During extreme training, DR. SAF can even surpass traditional instruction-based models in token efficiency with more than 16% accuracy improvement.

Qiguang Chen, Dengyun Peng, Jinhao Liu, HuiKang Su, Jiannan Guan, Libo Qin, Wanxiang Che• 2025

Related benchmarks

Task	Dataset	Result
Logical reasoning	LSAT-AR	Accuracy72.17	22
Out-of-Domain Reasoning Aggregation	OOD Average	Accuracy61.16	22
Multi-step Narrative Reasoning	MuSR	Accuracy61.31	22
Scientific Question Answering	GPQA Diamond	Accuracy (ACC)50	22
Mathematical Reasoning	MATH 500	Accuracy93.3	19
Mathematical Reasoning	AIME 2025	Accuracy57.9	15
Mathematical Reasoning	OlympiadBench	Accuracy71.3	14

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord