Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CARROT: A Cost Aware Rate Optimal Router

About

With the rapid growth in the number of Large Language Models (LLMs), there has been a recent interest in LLM routing, or directing queries to the cheapest LLM that can deliver a suitable response. We conduct a minimax analysis of the routing problem, providing a lower bound and finding that a simple router that predicts both cost and accuracy for each question can be minimax optimal. Inspired by this, we introduce CARROT, a Cost AwaRe Rate Optimal rouTer that selects a model based on estimates of the models' cost and performance. Alongside CARROT, we also introduce the Smart Price-aware ROUTing (SPROUT) dataset to facilitate routing on a wide spectrum of queries with the latest state-of-the-art LLMs. Using SPROUT and prior benchmarks such as Routerbench and open-LLM-leaderboard-v2 we empirically validate CARROT's performance against several alternative routers.

Seamus Somerstep, Felipe Maia Polo, Allysson Flavio Melo de Oliveira, Prattyush Mangal, M\'irian Silva, Onkar Bhardwaj, Mikhail Yurochkin, Subha Maity• 2025

Related benchmarks

TaskDatasetResultRank
CodingCoding Tasks (test)
Pass@195
42
Deep searchDeep Search Tasks (test)
Pass@186.3
42
RoutingSPROUT (test)
Accuracy89.9
11
RoutingRouterBench (test)
Accuracy74.9
11
LLM RoutingRouterBench held-out (test)
Accuracy74.9
6
LLM RoutingMMLU-Pro non-STEM
AUDC67
3
Showing 6 of 6 rows

Other info

Follow for update