Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multiple Choice Question Answering on MMLU (Performance Change Tracking)

61.41MMLU Baseline Accuracy (Before)

LLaMA-3.1 8B

58.339559.8747561.4162.94525Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
61.41--
2026.02
-62.330.92