Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reasoning on ARC Easy (Min/Max/Avg/Voting/ETTC)
Loading...
97.14
Minimum Score
Qwen-3-Thinking Ensemble
95.944
96.2545
96.565
96.8755
May 29, 2026
Minimum Score
Maximum Score
Average Score
Voting Consensus Score
ETTC (Estimated True Test Confidence)
Updated 2d ago
Evaluation Results
Method
Method
Links
Minimum Score
Maximum Score
Average Score
Voting Consensus Score
ETTC (Estimated True Test Confidence)
Qwen-3-Thinking Ensemble
Models=30B, 235B
2026.05
97.14
97.72
97.43
98.74
98.95
Qwen-3-Thinking Ensemble
Models=4B, 30B
2026.05
95.99
97.14
96.56
97.69
98.78
Qwen-3-Thinking Ensemble
Models=4B, 235B
2026.05
95.99
97.72
96.86
97.69
98.78
Qwen-3-Thinking Ensemble
Models=4B, 30B, 235B
2026.05
95.99
97.72
96.95
98.91
98.99
Feedback
Search any
task
Search any
task