Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Knowledge Utility on MMLU (Drop in Utility)
Loading...
-17.6
Drop in Utility
Latent Similarity
-19.76
-5.18
9.4
23.98
Apr 1, 2026
Drop in Utility
Updated 5d ago
Evaluation Results
Method
Method
Links
Drop in Utility
Latent Similarity
Selection Criterion=La...
2026.04
-17.6
Perplexity
Selection Criterion=Pe...
2026.04
-1.7
Random
Selection Criterion=Ra...
2026.04
1
Self Certainty
Selection Criterion=Se...
2026.04
5.8
Latent Similarity
Selection Criterion=La...
2026.04
11.4
KL
Selection Criterion=KL
2026.04
36.4
Feedback
Search any
task
Search any
task