Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

About

Recently, long-thought reasoning LLMs, such as OpenAI's O1, adopt extended reasoning processes similar to how humans ponder over complex problems. This reasoning paradigm significantly enhances the model's problem-solving abilities and has achieved promising results. However, long-thought reasoning process leads to a substantial increase in inference time. A pressing challenge is reducing the inference overhead of long-thought LLMs while ensuring accuracy. In this paper, we experimentally demonstrate that long-thought reasoning models struggle to effectively allocate token budgets based on problem difficulty and reasoning redundancies. To address this, we propose Length-Harmonizing Fine-Tuning (O1-Pruner), aiming at minimizing reasoning overhead while maintaining accuracy. This effective fine-tuning method first estimates the LLM's baseline performance through pre-sampling and then uses RL-style fine-tuning to encourage the model to generate shorter reasoning processes under accuracy constraints. This allows the model to achieve efficient reasoning with lower redundancy while maintaining accuracy. Experiments on various mathematical reasoning benchmarks show that O1-Pruner not only significantly reduces inference overhead but also achieves higher accuracy, providing a novel and promising solution to this challenge. Our code is coming soon at https://github.com/StarDewXXX/O1-Pruner

Haotian Luo, Li Shen, Haiying He, Yibo Wang, Shiwei Liu, Wei Li, Naiqiang Tan, Xiaochun Cao, Dacheng Tao• 2025

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy86.8
351
Mathematical ReasoningMATH 500
Accuracy91.2
73
Mathematical ReasoningAMC 2023
Accuracy87.66
65
Mathematical ReasoningMATH500
Accuracy85.75
57
Mathematical ReasoningSAT Math
SAT Math Accuracy91.21
44
Mathematical ReasoningAIME 2025
Accuracy38.33
38
Mathematical ReasoningGSM8K (test)
Accuracy96.5
33
Mathematical ReasoningAIME 2024
Accuracy51.46
33
Mathematical ReasoningMATH 500
Accuracy92.4
24
Mathematical ReasoningAMC 23
Accuracy90.5
24
Showing 10 of 20 rows

Other info

Follow for update