Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling on WikiText-2 context length 8192 (test)

6.5Perplexity

Llama3.1-8B-Instruct Baseline

-39,993.24230,005.005500,003.25770,001.495Feb 6, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
6.5
2026.02
7.32
2026.02
11.13
2026.02
12.75
2026.02
1,000,000