Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge on MMLU-redux (Accuracy)
Loading...
90.6
Accuracy
MiMo-V2-Flash Base
53.68
63.265
72.85
82.435
May 30, 2025
Jul 5, 2025
Aug 11, 2025
Sep 17, 2025
Oct 24, 2025
Nov 30, 2025
Jan 6, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MiMo-V2-Flash Base
# Shots=5-shot, # Acti...
2026.01
90.6
DeepSeek-V3.2 Exp Base
# Shots=5-shot, # Acti...
2026.01
90.4
Kimi-K2 Base
# Shots=5-shot, # Acti...
2026.01
90.2
DeepSeek-V3.1 Base
# Shots=5-shot, # Acti...
2026.01
90
RMoA
Model=GPT-4o
2025.05
86.67
MoA
Model=GPT-4o
2025.05
85.8
SMoA
Model=GPT-4o
2025.05
84.94
GPT-4o
Model=GPT-4o
2025.05
83.73
SMoA
Model=Qwen2.5-7B-Instruct
2025.05
72
RMoA
Model=Qwen2.5-7B-Instruct
2025.05
71.8
Qwen2.5-7B-Instruct
Model=Qwen2.5-7B-Instruct
2025.05
69.9
RMoA
Model=Gemma2-9B-Instruct
2025.05
66.1
SMoA
Model=Gemma2-9B-Instruct
2025.05
65.8
MoA
Model=Gemma2-9B-Instruct
2025.05
65.73
Gemma2-9B-Instruct
Model=Gemma2-9B-Instruct
2025.05
63.9
MoA
Model=Qwen2.5-7B-Instruct
2025.05
62.7
RMoA
Model=Llama3.1-8B-Inst...
2025.05
61.63
SMoA
Model=Llama3.1-8B-Inst...
2025.05
60.86
Llama3.1-8B-Instruct
Model=Llama3.1-8B-Inst...
2025.05
58.6
MoA
Model=Llama3.1-8B-Inst...
2025.05
55.1
Feedback
Search any
task
Search any
task