Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on Olympiad (test)
Loading...
52.1
Accuracy
OpenAI-o1-preview
7.9312
19.3981
30.865
42.3319
Feb 2, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI-o1-preview
Code Integration=No
2025.02
52.1
GPT-4o
Code Integration=No
2025.02
43.3
NuminaMath-72B
Code Integration=Yes
2025.02
36.7
AutoCode4Math-Qwen2.5
Code Integration=Auton...
2025.02
32.6
Qwen-2.5-Base-7B
Code Integration=No
2025.02
30.37
AutoCode4Math-DeepSeek
Code Integration=Auton...
2025.02
26.95
AutoCode4Math-Qwen2
Code Integration=Auton...
2025.02
26.37
Dart-Math-Llama3-8B
Code Integration=No
2025.02
23
NuminaMath-7B-CoT
Code Integration=No
2025.02
22.22
Qwen2Math-Base-7B
Code Integration=No
2025.02
21.62
Mathstral-7B
Code Integration=No
2025.02
21.5
DeepseekMath-Instruct-7B
Code Integration=Yes
2025.02
20.44
Dart-Math-DeepSeek-7B
Code Integration=No
2025.02
18.52
Mammoth-Mistral-7B
Code Integration=Yes
2025.02
9.63
Feedback
Search any
task
Search any
task