Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Reasoning on ruMATH-500
Loading...
97.2
Accuracy
DeepSeek-R1
42.912
57.006
71.1
85.194
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
DeepSeek-R1
Model Category=Open So...
2025.12
97.2
o4-mini (medium)
Model Category=Open So...
2025.12
95.8
T-pro 2.0
Model Category=Open So...
2025.12
94
Qwen3-32B
Model Category=Open So...
2025.12
93.8
DeepSeek-R1-Distill-Qwen-32B
Model Category=Open So...
2025.12
89.8
DeepSeek-V3
Model Category=Open So...
2025.12
88.2
Gemma 3 27B
Model Category=Open So...
2025.12
86
GPT-4o
Model Category=Open So...
2025.12
76.6
GigaChat 2 Max
Model Category=Open So...
2025.12
70.2
YandexGPT5-Pro
Model Category=Open So...
2025.12
68.2
RuadaptQwen3-32B-Instruct
Model Category=Open So...
2025.12
45
Feedback
Search any
task
Search any
task