Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robust Reasoning on MMLU-Pro (Accuracy)
Loading...
21
Accuracy
NITP
6.5856
10.3278
14.07
17.8122
May 24, 2026
Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy
NITP
Model scale=9bA1b, Eva...
2026.05
21
NTP
Model scale=9bA1b, Eva...
2026.05
15.29
NITP
Model scale=3bA0.5b, E...
2026.05
12.29
NTP
Model scale=3bA0.5b, E...
2026.05
11
NITP
Model scale=1.9bA0.3b,...
2026.05
7.47
NTP
Model scale=1.9bA0.3b,...
2026.05
7.14
Feedback
Search any
task
Search any
task