Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Not All Queries Need Deep Thought: CoFiCot for Adaptive Coarse-to-fine Stateful Refinement

About

Scaling test-time computation enhances LLM reasoning ability but faces a uniform computation paradox. Allocating identical resources leads to over-correction on simple tasks and insufficient refinement on complex ones. To address this, we propose CoFiCot, a coarse-to-fine adaptive framework that dynamically tailors inference strategies to problem difficulty. Specifically, we implement a multi-metric classifier that triages queries by synthesizing semantic entropy, consensus reliability, and predicted reasoning depth . This enables a differentiated refinement stage that applies efficient aggregation for simple queries while routing complex ones to a context-aware correction loop . We formalize correction as a stateful sequential propagation process , where each repair is strictly conditioned on the verified history of prior rectifications. By integrating Process Reward Models (PRMs) within this state-dependent trajectory, CoFiCot effectively bridges the gap between granular error localization and global logical coherence, preventing the context fragmentation typical of stateless refinement methods.

Dongxu Zhang, Hongqiang Lin, Yiding Sun, Pengyu Wang, Qirui Wang, Ning Yang, Jihua Zhu• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH--
882
Multitask Language UnderstandingMMLU
Accuracy84.9
413
Mathematical ReasoningSVAMP
Accuracy95.8
403
Mathematical Problem SolvingMATH
Accuracy57.7
229
Grade School Math ReasoningGSM8K
Accuracy (GSM8K)91.8
77
Commonsense ReasoningARC
Accuracy88.2
28
ReasoningSAT
Accuracy (SAT)97.6
17
Logical reasoningDate Understanding
Accuracy80.8
4
Showing 8 of 8 rows

Other info

Follow for update