Our new X account is live! Follow @wizwand_team for updates

Language Modeling on C4 LLaMA-1.3B (val)

13.13Perplexity

FOAM-2

Updated 4d ago

Evaluation Results

Method	Links
FOAM-2 2025.12		13.13	4.45
FOAM-3 2025.12		13.19	3.97
FOAM-Mini 2025.12		13.43	3.2
APOLLO-Mini 2025.12		14.18	3.2
APOLLO-1/4 2025.12		14.2	4.76
MUON 2025.12		14.28	5.61
APOLLO-1/8 2025.12		14.32	4.15
Full-Adam 2025.12		14.51	8.03
GWT-Mini 2025.12		14.99	3.2
Adam-Mini 2025.12		15.1	5.35
GaLore-1/4 2025.12		15.66	4.76
GaLore-1/8 2025.12		17.52	4.15