Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on Pretraining Dataset

3.133Train Loss (PT)

BHyT

3.103243.304123.5053.70588Dec 26, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
3.1333.10722.346
2025.12
3.163.13923.091
2025.12
3.1653.14223.156
2025.12
3.2033.1824.04
2025.12
3.2683.25425.908
2025.12
3.283.27126.342
2025.12
3.2813.27226.353
2025.12
3.2883.27926.545
2025.12
3.7093.69640.294
2025.12
3.8773.85547.244