Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Autoregressive Language Modeling on WikiText-103 (first 10M tokens)

90.5Perplexity (PPL)

TF-GPT

90.03293.19196.3599.509Apr 9, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
90.5-
2026.04
92.11.8
2026.04
96.36.4
2026.04
98.18.4
2026.04
102.212.9