Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking

About

Large Reasoning Models (LRMs) have achieved impressive performance on challenging tasks, yet their deep reasoning often incurs substantial computational costs. To achieve efficient reasoning, existing reinforcement learning methods still struggle to construct short reasoning path during the rollout stage, limiting effective learning. Inspired by Evidence Accumulation Models, we find that LRMs have accumulated sufficient information early in reasoning, making further reasoning steps redundant. Based on this insight, we propose Just-Enough Thinking (JET), which trains models to proactively terminate unnecessary reasoning. JET performs trajectory truncation during rollout to expose the model to short, distributionally consistent reasoning paths. Besides, it uses a quality-controlled length reward to better encourage concise reasoning while maintaining correctness. Extensive experiments demonstrate that JET significantly improves reasoning efficiency without sacrificing accuracy. Especially, DeepSeek-Distill-Qwen-1.5B achieves a 4.6% accuracy gain while reducing output length by 46.3% on the Olympiad benchmark. Our code is available in the GitHub.

Jinyi Han, Ying Huang, Ying Liao, Zishang Jiang, Xikun Lu, Haiquan Zhao, Xinyi Wang, Guanghao Zhou, Sihang Jiang, Jiaqing Liang, Weikang Zhou, Zeye Sun, Fei Yu, Yanghua Xiao• 2025

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH500 (test)
Accuracy91.2
514
Mathematical ReasoningAMC
Accuracy (ACC)76.8
203
Mathematical ReasoningAIME 2024 (test)
Accuracy54
159
Mathematical ReasoningAIME 24
Accuracy41
154
Mathematical ReasoningMATH 500
Accuracy (Acc)89
149
Mathematical ReasoningAIME24
Pass@1 Accuracy52.9
82
Mathematical ReasoningOlympiad
Accuracy0.591
68
Mathematical ReasoningAMC 23
Pass@1 Accuracy79.5
48
Multi-discipline UnderstandingMMLU
Accuracy63.9
33
Mathematical ReasoningAMC (test)
Accuracy (Pass@1)81
31
Showing 10 of 20 rows

Other info

Follow for update