Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Evaluation on LLaMA 1B 3.2

9.75Perplexity (PPL)

baseline

-6,109.8635,197.507576,504.875117,812.2425Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
9.7554.8236.76
2026.03
12.5250.4426.34
2026.03
13.1750.0326.64
2026.03
13.4748.9526.38
2026.03
69.2240.0424.43
2026.03
153,00035.5924.37