Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM-as-judge evaluation on BookCorpus
Loading...
2.4
Diversity Rank
OverRIDE
1.5992
1.8071
2.015
2.2229
Apr 27, 2026
Diversity Rank
Quality Rank
Updated 1mo ago
Evaluation Results
Method
Method
Links
Diversity Rank
Quality Rank
OverRIDE
2026.04
2.4
2.2
Vanilla
2026.04
1.97
1.83
ESamp
2026.04
1.63
1.97
Feedback
Search any
task
Search any
task