Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CP-Router: An Uncertainty-Aware Router Between LLM and LRM

About

Recent advances in Large Reasoning Models (LRMs) have significantly improved long-chain reasoning capabilities over Large Language Models (LLMs). However, LRMs often produce unnecessarily lengthy outputs even for simple queries, leading to inefficiencies or even accuracy degradation compared to LLMs. To overcome this, we propose CP-Router, a training-free and model-agnostic routing framework that dynamically selects between an LLM and an LRM, demonstrated with multiple-choice question answering (MCQA) prompts. The routing decision is guided by the prediction uncertainty estimates derived via Conformal Prediction (CP), which provides rigorous coverage guarantees. To further refine the uncertainty differentiation across inputs, we introduce Full and Binary Entropy (FBE), a novel entropy-based criterion that adaptively selects the appropriate CP threshold. Experiments across diverse MCQA benchmarks, including mathematics, logical reasoning, and Chinese chemistry, demonstrate that CP-Router efficiently reduces token usage while maintaining or even improving accuracy compared to using LRM alone. We also extend CP-Router to diverse model pairings and open-ended QA, where it continues to demonstrate strong performance, validating its generality and robustness.

Jiayuan Su, Fulin Lin, Zhaopeng Feng, Han Zheng, Teng Wang, Zhenyu Xiao, Xinlong Zhao, Zuozhu Liu, Lu Cheng, Hongwei Wang• 2025

Related benchmarks

TaskDatasetResultRank
LLM RoutingMATH Out-of-Domain
AUROC55.3
7
LLM RoutingMATH Out-of-Domain
LPM Score63.81
7
LLM RoutingMMLU Pro Humanities Out-of-Domain
LPM50.77
7
LLM RoutingMMLU Pro Others (Out-of-Domain)
LPM0.5918
7
LLM RoutingBig Math In-Domain
AUROC0.5141
7
LLM RoutingMMLU In-Domain
AUROC0.4926
7
LLM RoutingMagpie Out-of-Domain
AUROC52.62
7
LLM RoutingMagpie Out-of-Domain
LPM Score63.08
7
LLM RoutingMMLU Pro Social Sciences (Out-of-Domain)
LPM57.1
7
LLM RoutingMMLU In-Domain
LPM75.29
7
Showing 10 of 13 rows

Other info

Follow for update