Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Language Understanding on MMLU (Accuracy, Utility Preservation)

76.89Accuracy

Baseline

37.484447.714757.94568.1753Mar 16, 2026Mar 19, 2026Mar 23, 2026Mar 26, 2026Mar 30, 2026Apr 2, 2026Apr 6, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.03
76.89-
2026.03
69.8490.8
2026.04
48-
2026.04
47-
2026.04
39-