Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Router: Rethinking Routing with Prefill Activations

About

LLMs often achieve similar average benchmark accuracies while exhibiting complementary strengths on different subsets of queries, suggesting that a router with query-specific model selection can outperform any single model. While existing routers rely on semantic query features, they often fail to capture model-specific failures or intrinsic task difficulty. We instead study routing via internal prefill activations. Our key idea, Encoder-Target Decoupling, separates the model that produces the predictive signal (the Encoder) from the model whose correctness is being estimated (the Target), allowing open-weight encoders to predict the performance of closed-source target models. We evaluate layerwise geometric probes, finding that Fisher Separability (J) effectively identifies informative layers, supported by Effective Dimensionality (d_eff) diagnostics. We then utilize a SharedTrunkNet, a joint multi-output MLP that predicts simultaneous correctness probabilities across candidate models using concatenated prefill features. In our experiments, SharedTrunkNet consistently outperforms semantic baselines. At its best, SharedTrunkNet closes 45.58% of the gap between the strongest standalone model and the oracle while achieving 74.31% cost savings relative to the most expensive model. These results demonstrate that prefill activations provide a robust routing signal, establishing mechanistic routing as a high-performance alternative to purely semantic selection.

Tanay Varshney, Annie Surla, Michelle Xu, Gomathy Venkata Krishnan, Maximilian Jeblick, David Austin, Neal Vaidya, Davide Onofrio• 2026

Related benchmarks

TaskDatasetResultRank
Model RoutingGlobal Routing Dataset Frontier Pool
P-AUCCC0.4377
7
Model RoutingGlobal Routing Dataset Small Pool
P-AUCCC0.5472
7
Model RoutingGlobal Routing Dataset Mixed Pool
P-AUCCC0.2323
7
Model RoutingFrontier pool
Mean per-model AUC0.856
6
Model RoutingSmall pool
Mean per-model AUC82.6
6
Model RoutingMixed pool
Mean per-model AUC0.8817
6
Model RoutingFrontier pool
Oracle Accuracy89.3
6
Model RoutingSmall pool
Oracle Accuracy92
6
Showing 8 of 8 rows

Other info

Follow for update