Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge Evaluation on MMLU-Redux 2.0 (Continual)

33.49Accuracy

STOC

Updated 2mo ago

Evaluation Results

Method	Links
STOC 2026.05		33.49
STOC 2026.05		32.93
LAMOL 2026.05		31.71
LAMOL 2026.05		28.05
Naive 2026.05		24.39
Naive 2026.05		24.17