Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling on Public Pretraining Dataset (train)

1.33Loss

Adam

1.329681.331841.3341.33616Apr 10, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.04
1.33
2026.04
1.331
2026.04
1.338
2026.04
1.338