Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Diversity on news
Loading...
47
Wins
RkH
31.4
35.45
39.5
43.55
Mar 5, 2026
Wins
Losses
Ties
Updated 1mo ago
Evaluation Results
Method
Method
Links
Wins
Losses
Ties
RkH
LLM Evaluator=GPT-5-mi...
2026.03
47
38
15
MRC
LLM Evaluator=GPT-5-mi...
2026.03
45
29
24
M2hC
LLM Evaluator=GPT-5-mi...
2026.03
42
38
20
RkH
LLM Evaluator=GPT-5-mi...
2026.03
37
34
29
MRC
LLM Evaluator=GPT-5-mi...
2026.03
34
30
36
M2hC
LLM Evaluator=GPT-5-mi...
2026.03
32
33
35
Feedback
Search any
task
Search any
task