Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on DCLM evaluation suite (test)

62.9HellaSwag Accuracy

LLaMA-3.2

29.72438.33746.9555.563May 18, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.05
62.964.867.837.6763775.762.779.960.965.73647.934.464.345.840.523.925.353.1
2026.05
48.347.371.544.56954.169.839.860.855.857.230.331.86.940.640.531.924.328.244.9
2026.05
4848.259.531.16758.27049.469.655.761.432.449.213.959.444.230.525.231.847.6
2026.05
43.542.965.132.56619.369.438.266.351.656.621.12910.153.840.516.723.924.840.6
2026.05
4342.764.432.56723.568.439.764.150.95722.13012.153.338.917.628.324.841.1
2026.05
40.139.758.328.96720.767.642.965.253.651.722.3251354.843.6213025.440.6
2026.05
3130.841.722.4632362.432.257.550.754.713.65.80.328.342.3920.925.832.4