Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Evaluation on LLaMA 3B 3.2

7.81PPL

baseline

-1,483.87768,585.013718,653.90528,722.7963Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
7.8162.6654.06
2026.03
9.1558.545.7
2026.03
9.7355.7644.75
2026.03
10.1556.8842.42
2026.03
53.3344.1727.31
2026.03
37,30035.7223.41