Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Diversity on microsoft
Loading...
54
Win Count
MRC
32.16
37.83
43.5
49.17
Mar 5, 2026
Win Count
Loss Count
Tie Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Count
Loss Count
Tie Count
MRC
LLM Evaluator=GPT-5-mi...
2026.03
54
38
8
MRC
LLM Evaluator=GPT-5-mi...
2026.03
53
35
12
M2hC
LLM Evaluator=GPT-5-mi...
2026.03
52
40
8
RkH
LLM Evaluator=GPT-5-mi...
2026.03
48
44
8
M2hC
LLM Evaluator=GPT-5-mi...
2026.03
45
48
7
RkH
LLM Evaluator=GPT-5-mi...
2026.03
33
53
14
Feedback
Search any
task
Search any
task