Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on MMLU-Pro (EM)
Loading...
90.1
Exact Match
Gemini-3.0 Pro
81.676
83.863
86.05
88.237
Dec 2, 2025
Exact Match
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Match
Gemini-3.0 Pro
temperature=1, context...
2025.12
90.1
Claude-4.5-Sonnet
temperature=1, context...
2025.12
88.2
GPT-5 High
temperature=1, context...
2025.12
87.5
DeepSeek-V3.2
thinking mode=true, te...
2025.12
85
Kimi-K2
thinking mode=true, te...
2025.12
84.6
MiniMax M2
temperature=1, context...
2025.12
82
Feedback
Search any
task
Search any
task