Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Reasoning on T-Math
Loading...
63.4
Accuracy
o4-mini (medium)
8.488
22.744
37
51.256
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
o4-mini (medium)
Model Category=Open So...
2025.12
63.4
DeepSeek-R1
Model Category=Open So...
2025.12
61.9
T-pro 2.0
Model Category=Open So...
2025.12
54.1
Qwen3-32B
Model Category=Open So...
2025.12
52.9
RuadaptQwen3-32B-Instruct
Model Category=Open So...
2025.12
44.4
DeepSeek-V3
Model Category=Open So...
2025.12
27.8
DeepSeek-R1-Distill-Qwen-32B
Model Category=Open So...
2025.12
25.4
Gemma 3 27B
Model Category=Open So...
2025.12
20.8
GigaChat 2 Max
Model Category=Open So...
2025.12
14.2
YandexGPT5-Pro
Model Category=Open So...
2025.12
13
GPT-4o
Model Category=Open So...
2025.12
10.6
Feedback
Search any
task
Search any
task