Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on Fineweb-edu 1.0 (test)

2.32LM Loss

Random Sampling KD (Ours 12+)

2.31282.36142.412.4586Mar 21, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.03
2.32
2025.03
2.34
2025.03
2.35
2025.03
2.37
2025.03
2.4
2025.03
2.5