Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Evaluation on LLaMA-2 13B

4.88Perplexity

baseline

-522.92483,039.75766,602.4410,165.1224Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
4.8866.5352.04
2026.03
5.264.847.8
2026.03
5.264.8247.17
2026.03
5.4162.5547.25
2026.03
6.3161.2839.83
2026.03
13,20034.5323.85