Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Routing on Alpaca In-Domain
Loading...
0.7202
AUROC
ProbeDirichlet
0.452088
0.521694
0.5913
0.660906
Feb 12, 2026
AUROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUROC
ProbeDirichlet
signal_modality=Hidden...
2026.02
0.7202
EmbeddingMLP
signal_modality=Embedd...
2026.02
0.6731
SemanticEntropy
signal_modality=Verbos...
2026.02
0.6202
MaxLogits
signal_modality=Logit-...
2026.02
0.5796
ConfidenceMargin
signal_modality=Logit-...
2026.02
0.5338
SelfAsk
signal_modality=Verbos...
2026.02
0.4903
Entropy
signal_modality=Logit-...
2026.02
0.4624
Feedback
Search any
task
Search any
task