Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Reasoning on ruAIME 2025
Loading...
80
Accuracy
DeepSeek-R1
1.584
21.942
42.3
62.658
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
DeepSeek-R1
Model Category=Open So...
2025.12
80
o4-mini (medium)
Model Category=Open So...
2025.12
77.1
T-pro 2.0
Model Category=Open So...
2025.12
64.6
Qwen3-32B
Model Category=Open So...
2025.12
62.5
RuadaptQwen3-32B-Instruct
Model Category=Open So...
2025.12
45
DeepSeek-R1-Distill-Qwen-32B
Model Category=Open So...
2025.12
40.2
DeepSeek-V3
Model Category=Open So...
2025.12
28.5
Gemma 3 27B
Model Category=Open So...
2025.12
23.1
GPT-4o
Model Category=Open So...
2025.12
6.9
GigaChat 2 Max
Model Category=Open So...
2025.12
6.2
YandexGPT5-Pro
Model Category=Open So...
2025.12
4.6
Feedback
Search any
task
Search any
task