Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Modeling on C4 LLaMA-1.3B (val)
Loading...
13.13
Perplexity
FOAM-2
12.9544
14.1397
15.325
16.5103
Dec 8, 2025
Perplexity
Memory (G)
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
Memory (G)
FOAM-2
Training Tokens=13.1B,...
2025.12
13.13
4.45
FOAM-3
Training Tokens=13.1B,...
2025.12
13.19
3.97
FOAM-Mini
Training Tokens=13.1B,...
2025.12
13.43
3.2
APOLLO-Mini
Training Tokens=13.1B,...
2025.12
14.18
3.2
APOLLO-1/4
Training Tokens=13.1B,...
2025.12
14.2
4.76
MUON
Training Tokens=13.1B,...
2025.12
14.28
5.61
APOLLO-1/8
Training Tokens=13.1B,...
2025.12
14.32
4.15
Full-Adam
Training Tokens=13.1B,...
2025.12
14.51
8.03
GWT-Mini
Training Tokens=13.1B,...
2025.12
14.99
3.2
Adam-Mini
Training Tokens=13.1B,...
2025.12
15.1
5.35
GaLore-1/4
Training Tokens=13.1B,...
2025.12
15.66
4.76
GaLore-1/8
Training Tokens=13.1B,...
2025.12
17.52
4.15
Feedback
Search any
task
Search any
task