Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematics on AIME 2024 (Pass@1)
Loading...
0.798
Pass@1
DeepSeek-R1
0.0648
0.25515
0.4455
0.63585
Jan 22, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
DeepSeek-R1
Architecture=MoE, Acti...
2025.01
0.798
OpenAI-o1-1217
2025.01
0.792
OpenAI-o1-mini
2025.01
0.636
DeepSeek-V3
Architecture=MoE, Acti...
2025.01
0.392
Claude-3.5-Sonnet-1022
2025.01
0.16
GPT-4o-0513
2025.01
0.093
Feedback
Search any
task
Search any
task