Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

About

Recent advancements in large language models (LLMs) have greatly improved their capabilities on complex reasoning tasks through Long Chain-of-Thought (CoT). However, this approach often results in substantial redundancy, impairing computational efficiency and causing significant delays in real-time applications. To improve the efficiency, current methods often rely on human-defined difficulty priors, which do not align with the LLM's self-awared difficulty, leading to inefficiencies. In this paper, we introduce the Dynamic Reasoning-Boundary Self-Awareness Framework (DR. SAF), which enables models to dynamically assess and adjust their reasoning depth in response to problem complexity. DR. SAF integrates three key components: Boundary Self-Awareness Alignment, Adaptive Reward Management, and a Boundary Preservation Mechanism. These components allow models to optimize their reasoning processes, balancing efficiency and accuracy without compromising performance. Our experimental results demonstrate that DR. SAF achieves a 49.27% reduction in total response tokens with minimal loss in accuracy. The framework also delivers a 6.59x gain in token efficiency and a 5x reduction in training time, making it well-suited to resource-limited settings. During extreme training, DR. SAF can even surpass traditional instruction-based models in token efficiency with more than 16% accuracy improvement.

Qiguang Chen, Dengyun Peng, Jinhao Liu, HuiKang Su, Jiannan Guan, Libo Qin, Wanxiang Che• 2025

Related benchmarks

TaskDatasetResultRank
Logical reasoningLSAT-AR
Accuracy72.17
22
Out-of-Domain Reasoning AggregationOOD Average
Accuracy61.16
22
Multi-step Narrative ReasoningMuSR
Accuracy61.31
22
Scientific Question AnsweringGPQA Diamond
Accuracy (ACC)50
22
Mathematical ReasoningMATH 500
Accuracy93.3
19
Mathematical ReasoningAIME 2025
Accuracy57.9
15
Mathematical ReasoningOlympiadBench
Accuracy71.3
14
Showing 7 of 7 rows

Other info

Follow for update