Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Evaluation on LLaMA-3 8B

6.13PPL

baseline

-18,833.6248108,334.7201235,503.065362,671.4099Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
6.1367.1662.13
2026.03
7.7562.4153.8
2026.03
8.2661.7549.93
2026.03
8.4159.1247.29
2026.03
17.2648.9729.3
2026.03
471,00036.3425.17