Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Late-Evidence Analysis on DCI Evaluation Suite Late-Evid.
Loading...
9.6
Quality Score
Single Agent
8.404
8.7145
9.025
9.3355
Mar 12, 2026
Quality Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Quality Score
Single Agent
2026.03
9.6
Voting
2026.03
9.26
DCI
2026.03
9.24
Self-Consistency
2026.03
8.8
Unstr. Debate
2026.03
8.45
Feedback
Search any
task
Search any
task