Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Character-level Language Modeling on text8 100M regime (Forward split)

2.19Forward BPC

Transformer (ctx=1024)

2.15082.41542.682.9446May 30, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
2.19
2026.05
2.22
2026.05
2.36
2026.05
2.57
2026.05
2.7
2026.05
2.88
2026.05
3.17