Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SlimPajama

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingSlimPajama (test)
PPL (CommonCrawl)0.5402
23
Language ModelingSlimPajama
Perplexity (PPL)7.4
18
Language ModelingSlimPajama (val)
Perplexity3.09
13
Language ModelingSlimPajama large-scale (train)
L(ψ)2.129
8
Factuality EvaluationSlimPajama
Pointwise Score63.5
3
Generation Quality and Coherence EvaluationSlimPajama Quality Evaluation (test)
Gen Quality (Std. Prefix)86.3
3
Language ModelingSlimPajama 10M (dev)
Perplexity9.219
3
Showing 7 of 7 rows