Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multiple Choice Question Answering on MMLU (Performance Change Tracking)
Loading...
61.41
MMLU Baseline Accuracy (Before)
LLaMA-3.1 8B
58.3395
59.87475
61.41
62.94525
Feb 11, 2026
MMLU Baseline Accuracy (Before)
MMLU Post-Intervention Accuracy (After)
MMLU Accuracy Gain (Absolute)
Updated 4d ago
Evaluation Results
Method
Method
Links
MMLU Baseline Accuracy (Before)
MMLU Post-Intervention Accuracy (After)
MMLU Accuracy Gain (Absolute)
LLaMA-3.1 8B
Evaluation Protocol=0-...
2026.02
61.41
-
-
CRL-Token
Backbone=LLaMA-3.1 8B,...
2026.02
-
62.33
0.92
Feedback
Search any
task
Search any
task