Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenWebText

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingOpenWebText2 (test)
Perplexity16.2
104
Language ModelingOpenWebText (val)
Validation Loss2.6091
70
Text GenerationOpenWebText
Perplexity132.55
66
Unconditional Text GenerationOpenWebText
Gen. PPL11.1
56
Language ModelingOpenWebText
Perplexity11
50
Unconditional generationOpenWebText (OWT) L=1024 (held-out)
MAUVE1
45
Sentiment SteeringOpenWebText Neutral to Negative (test)
Perplexity (PPL)12.48
27
Sentiment SteeringOpenWebText Neutral to Positive (test)
Perplexity (PPL)12.48
27
Unconditional Text GenerationOpenWebText (test)
LLAMA2 Score692.3
21
Embedding Space AnalysisOpenWebText
Iso0.98
18
Language ModelingOpenWebText (test)
Loss2.65
18
Language ModelingOpenWebText standard (test)
Perplexity20.08
17
Language ModelingOpenWebText (held-out set)
PPL11.5
16
Language ModelingOpenWebText GPT-2 (test)
Perplexity17.94
13
Language ModelingOpenWebText (OWT) (val)
Perplexity17.5
12
Unconditional generationOpenWebText L=2048 (test)
Gen. PPL13.2
12
Unconditional generationOpenWebText L=1024 (test)
Generation Perplexity14.1
12
Language ModelingOpenWebText2 (val)
Perplexity17.12
12
Text GenerationOpenWebText (OWT) GPT-2 tokenizer (val)
PPL15.36
12
Language ModelingOpenWebText (train)
Train Loss2.5243
11
Language ModelingOpenWebText GPT-2 124M (val)
LCE3.167
8
Text GenerationOpenWebText (test)
Average Perplexity3.77
8
Language generationOpenWebText (val)
OLMo Perplexity14.2
8
Sentiment SteeringOpenWebText Positive prompts (test)
Negativity Score0.6
8
Sentiment SteeringOpenWebText Negative prompts (test)
Positivity Score0.59
8
Showing 25 of 41 rows