Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on WikiText-2 context length 8192 (test)

6.5Perplexity

Llama3.1-8B-Instruct Baseline

-39,993.24230,005.005500,003.25770,001.495Feb 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
6.5
2026.02
7.32
2026.02
11.13
2026.02
12.75
2026.02
1,000,000