Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Critique Quality Evaluation on Human Evaluation Overall
Loading...
66
Win Rate
RCO
36.88
44.44
52
59.56
Jun 27, 2025
Win Rate
Tie Rate
Loss Rate
Gap Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Rate
Tie Rate
Loss Rate
Gap Score
RCO
Comparison Target=LLaM...
2025.06
66
18.5
15.5
50.5
RCO
Comparison Target=LLaM...
2025.06
49
24.5
26.5
22.5
RCO
Comparison Target=LLaM...
2025.06
48
35
17
31
RCO
Comparison Target=LLaM...
2025.06
47.5
25.5
27
20.5
RCO
Comparison Target=LLaM...
2025.06
43
33.5
23.5
19.5
RCO
Comparison Target=LLaM...
2025.06
38
32.5
29.5
8.5
Feedback
Search any
task
Search any
task