Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge Evaluation on MMLU-Redux 2.0 (Original)

42.03Accuracy

STOC

21.12626.55331.9837.407May 11, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
42.03
2026.05
40.53
2026.05
40.26
2026.05
39.04
2026.05
23.48
2026.05
21.93