Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning

About

The emergence of large reasoning models demonstrates that scaling inference-time compute significantly enhances performance on complex tasks. However, it often falls into another trap: overthinking simple problems, where repetitive rationales yield minimal accuracy gains at a disproportionately high cost. This motivates adaptive reasoning: dynamically aligning reasoning depth with instance difficulty. In this paper, we study adaptive reasoning from an optimality perspective, formalizing it as a utility maximization problem where tokens are allocated until the marginal accuracy gain falls below the incremental cost. Based on this, we propose CODA (Compute Allocation by Difficulty Awareness), a method that operationalizes this principle by allocating tokens via a policy-internal difficulty signal. Specifically, CODA estimates difficulty via group-based rollouts and maps it to two non-negative gates that modulate a length-dependent shaping term on top of the binary base reward. The easy-side gate penalizes verbosity on simple instances, whereas the hard-side gate encourages more deliberative rollouts on challenging ones. Across model scales and benchmarks, CODA achieves adaptive reasoning without external annotations or user-provided budgets: on easy tasks, CODA reduces token costs by over 60% while maintaining strong accuracy, whereas on hard tasks it incentivizes more deliberative rollouts to maximize performance.

Siye Wu, Jian Xie, Yikai Zhang, Yanghua Xiao• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy94
499
Mathematical ReasoningMATH
Accuracy91.1
338
Mathematical ReasoningAIME24
Accuracy40.8
160
Mathematical ReasoningAMC 23
Accuracy82.2
15
General ReasoningCSQA
Accuracy81.3
15
General ReasoningGPQA
Accuracy50.6
15
Mathematical ReasoningSVAMP
Accuracy94.1
15
Mathematical ReasoningAIME 25
Accuracy30.4
15
Showing 8 of 8 rows

Other info

Follow for update