Our new X account is live! Follow @wizwand_team for updates

LLM Routing on Big Math In-Domain

0.6618AUROC

ProbeDirichlet

Updated 4d ago

Evaluation Results

Method	Links
ProbeDirichlet 2026.02		0.6618
ConfidenceMargin 2026.02		0.5618
EmbeddingMLP 2026.02		0.5618
SemanticEntropy 2026.02		0.5581
Entropy 2026.02		0.5141
MaxLogits 2026.02		0.4739
SelfAsk 2026.02		0.472