Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding and Reasoning on MMLU, TruthfulQA, HellaSwag, and ARC-Easy (test)

0.082MMLU Score

Unlearn-Smooth

-0.002240.019630.04150.06337May 14, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2026.05
0.0820.0340.480.4570.263
2026.05
0.0710.0080.0720.1260.065
2026.05
0.0290.0170.0470.0580.029
2026.05
0.0110.0480.0240.0180.016
2026.05
0.0110.0450.0010.0190.009
2026.05
0.010.0110.0240.0370.015
2026.05
0.010.0010.020.0140.006
2026.05
0.0080.0060.0130.0350.009
2026.05
0.0070.1010.0040.0380.018
2026.05
0.0070.050.0090.1380.026
2026.05
0.0070.0920.0040.0340.017
2026.05
0.0060.0070.0230.0540.016
2026.05
0.0040.0210.0120.020.002
2026.05
0.0030.0690.0030.0540.002
2026.05
0.0010.0040.0360.0610.023
2026.05
0.0010.10.0250.0540.018