Not All Queries Need Deep Thought: CoFiCot for Adaptive Coarse-to-fine Stateful Refinement

About

Scaling test-time computation enhances LLM reasoning ability but faces a uniform computation paradox. Allocating identical resources leads to over-correction on simple tasks and insufficient refinement on complex ones. To address this, we propose CoFiCot, a coarse-to-fine adaptive framework that dynamically tailors inference strategies to problem difficulty. Specifically, we implement a multi-metric classifier that triages queries by synthesizing semantic entropy, consensus reliability, and predicted reasoning depth . This enables a differentiated refinement stage that applies efficient aggregation for simple queries while routing complex ones to a context-aware correction loop . We formalize correction as a stateful sequential propagation process , where each repair is strictly conditioned on the verified history of prior rectifications. By integrating Process Reward Models (PRMs) within this state-dependent trajectory, CoFiCot effectively bridges the gap between granular error localization and global logical coherence, preventing the context fragmentation typical of stateless refinement methods.

Dongxu Zhang, Hongqiang Lin, Yiding Sun, Pengyu Wang, Qirui Wang, Ning Yang, Jihua Zhu• 2026

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	MATH	--	882
Multitask Language Understanding	MMLU	Accuracy84.9	568
Mathematical Reasoning	SVAMP	Accuracy95.8	403
Mathematical Problem Solving	MATH	Accuracy57.7	229
Grade School Math Reasoning	GSM8K	Accuracy (GSM8K)91.8	186
Commonsense Reasoning	ARC	Accuracy88.2	61
Reasoning	SAT	Accuracy (SAT)97.6	17
Logical reasoning	Date Understanding	Accuracy80.8	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord