Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Knowledge on MMLU
Loading...
11,691.83
Avg Input Context Length (tokens)
PRD
1,269.626
3,975.3905
6,681.155
9,386.9195
Jul 5, 2025
Avg Input Context Length (tokens)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Avg Input Context Length (tokens)
PRD
Type=Full Debate
2025.07
11,691.83
RECONCILE
Type=Full Debate
2025.07
8,409.61
MLD
Type=Full Debate
2025.07
7,905.84
ChatEval
Type=Full Debate
2025.07
7,325.2
GD
Type=Part Debate
2025.07
6,156.13
ND
Type=Part Debate
2025.07
4,740.46
Ours
Type=Part Debate
2025.07
3,727.65
MaV
Type=No Debate
2025.07
1,670.48
Feedback
Search any
task
Search any
task