Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Reasoning on ruGPQA Diamond
Loading...
0.773
Accuracy
o4-mini (medium)
0.33724
0.45037
0.5635
0.67663
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
o4-mini (medium)
Model Category=Open So...
2025.12
0.773
DeepSeek-R1
Model Category=Open So...
2025.12
0.763
DeepSeek-V3
Model Category=Open So...
2025.12
0.657
DeepSeek-R1-Distill-Qwen-32B
Model Category=Open So...
2025.12
0.631
Qwen3-32B
Model Category=Open So...
2025.12
0.606
T-pro 2.0
Model Category=Open So...
2025.12
0.591
RuadaptQwen3-32B-Instruct
Model Category=Open So...
2025.12
0.591
GPT-4o
Model Category=Open So...
2025.12
0.51
GigaChat 2 Max
Model Category=Open So...
2025.12
0.475
Gemma 3 27B
Model Category=Open So...
2025.12
0.439
YandexGPT5-Pro
Model Category=Open So...
2025.12
0.354
Feedback
Search any
task
Search any
task