Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Routing on Magpie Out-of-Domain
Loading...
74.08
AUROC
ProbeDirichlet
35.6104
45.5977
55.585
65.5723
Feb 12, 2026
AUROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUROC
ProbeDirichlet
signal_modality=Hidden...
2026.02
74.08
EmbeddingMLP
signal_modality=Embedd...
2026.02
68.97
MaxLogits
signal_modality=Logit-...
2026.02
60.86
SemanticEntropy
signal_modality=Verbos...
2026.02
58.82
Entropy
signal_modality=Logit-...
2026.02
52.62
ConfidenceMargin
signal_modality=Logit-...
2026.02
43.08
SelfAsk
signal_modality=Verbos...
2026.02
37.09
Feedback
Search any
task
Search any
task