Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on OOD
Loading...
1.285
Loss
Muon
1.27908
1.31904
1.359
1.39896
Apr 10, 2026
Loss
Updated 6d ago
Evaluation Results
Method
Method
Links
Loss
Muon
Optimizer=Muon, Model...
2026.04
1.285
Adam+Nexus
Optimizer=Adam+Nexus,...
2026.04
1.29
Nexus
Model Size=3B, Optimiz...
2026.04
1.29
AdamW
Optimizer=AdamW, Model...
2026.04
1.302
AdamW
Model Size=3B, Optimiz...
2026.04
1.302
Nexus
Model Size=1B, Optimiz...
2026.04
1.428
AdamW
Model Size=1B, Optimiz...
2026.04
1.433
Feedback
Search any
task
Search any
task