Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SlimPajama

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingSlimPajama
Perplexity (PPL)2.97
77
Language ModelingSlimPajama latest (val)
Validation Loss3.078
26
Language ModelingSlimPajama (test)
PPL (CommonCrawl)0.5402
23
Language ModelingSlimPajama (val)
Perplexity3.09
13
Language ModelingSlimPajama large-scale (train)
L(ψ)2.129
8
Language ModelingSlimPajama-672B (val)
Validation Perplexity8.09
6
Factuality EvaluationSlimPajama
Pointwise Score63.5
3
Generation Quality and Coherence EvaluationSlimPajama Quality Evaluation (test)
Gen Quality (Std. Prefix)86.3
3
Language ModelingSlimPajama 10M (dev)
Perplexity9.219
3
Self-attention inverse temperature scaling analysisSlimPajama
Tie Percentage6
2
Language ModelingSlimPajama 6B (val)
Validation Perplexity (1B tokens)38.47
2
Language ModelingSlimPajama-6B (train)
Train Loss (1B tokens)3.778
2
Showing 12 of 12 rows