Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge on MMLU (EM)
Loading...
91.8
EM
OpenAI-o1-1217
84.936
86.718
88.5
90.282
Jan 22, 2025
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
OpenAI-o1-1217
2025.01
91.8
DeepSeek-R1
Architecture=MoE, Acti...
2025.01
90.8
DeepSeek-V3
Architecture=MoE, Acti...
2025.01
88.5
Claude-3.5-Sonnet-1022
2025.01
88.3
GPT-4o-0513
2025.01
87.2
OpenAI-o1-mini
2025.01
85.2
Feedback
Search any
task
Search any
task