Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fineweb-edu

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingFineWeb-Edu (test)
Perplexity (Test)20.7
49
Language ModelingFineWeb-Edu 500M-token (val)
Valid Loss2.221
18
Soft SearchFineWeb-Edu English, 1.4T tokens (test)
Similarity Score100
12
Language ModelingFineWeb-Edu (val)
Final Validation Loss4.2838
8
Language ModelingFineweb-edu distillation 8B to 300M
LM Loss2.74
7
Language ModelingFineWeb-Edu 1.4B tokens (val)
Loss3.271
3
Showing 6 of 6 rows