Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Reasoning on ruAIME 2024
Loading...
80
Accuracy
DeepSeek-R1
3.248
23.174
43.1
63.026
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
DeepSeek-R1
Model Category=Open So...
2025.12
80
o4-mini (medium)
Model Category=Open So...
2025.12
78.1
Qwen3-32B
Model Category=Open So...
2025.12
70.6
T-pro 2.0
Model Category=Open So...
2025.12
70.4
RuadaptQwen3-32B-Instruct
Model Category=Open So...
2025.12
57.5
DeepSeek-R1-Distill-Qwen-32B
Model Category=Open So...
2025.12
51
DeepSeek-V3
Model Category=Open So...
2025.12
31.9
Gemma 3 27B
Model Category=Open So...
2025.12
24.8
GigaChat 2 Max
Model Category=Open So...
2025.12
10.2
GPT-4o
Model Category=Open So...
2025.12
9
YandexGPT5-Pro
Model Category=Open So...
2025.12
6.2
Feedback
Search any
task
Search any
task