Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge on MMLU
Loading...
11,691.83
Avg Input Context Length (tokens)
PRD
1,269.626
3,975.3905
6,681.155
9,386.9195
Jul 5, 2025
Avg Input Context Length (tokens)
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Input Context Length (tokens)
PRD
Type=Full Debate
2025.07
11,691.83
RECONCILE
Type=Full Debate
2025.07
8,409.61
MLD
Type=Full Debate
2025.07
7,905.84
ChatEval
Type=Full Debate
2025.07
7,325.2
GD
Type=Part Debate
2025.07
6,156.13
ND
Type=Part Debate
2025.07
4,740.46
Ours
Type=Part Debate
2025.07
3,727.65
MaV
Type=No Debate
2025.07
1,670.48
Feedback
Search any
task
Search any
task