Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on C4 Llama-160M scratch (val)

3.0908Validation Loss

GPA-AdamW

3.0888523.1020013.115153.128299Dec 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
3.0908
2025.12
3.0951
2025.12
3.0974
2025.12
3.1051
2025.12
3.1051
2025.12
3.1066
2025.12
3.1089
2025.12
3.1089
2025.12
3.1089
2025.12
3.1089
2025.12
3.1089
2025.12
3.1133
2025.12
3.1141
2025.12
3.1141
2025.12
3.1141
2025.12
3.1141
2025.12
3.1141
2025.12
3.128
2025.12
3.1353
2025.12
3.1395