Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WikiText

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWikiText-2 (test)
PPL1.2
2,333
Language ModelingWikiText
PPL0.2838
740
Language ModelingWikiText
Word Perplexity3.12
234
Language ModelingWikiText-2
Perplexity4.88
105
Language ModelingWikitext
Wikitext PPL12.85
87
Language ModelingWikiText (test)
Perplexity4.88
66
Language ModelingWikiText (val)
Perplexity12.51
62
Language ModelingWikiText v1 (test)
Perplexity13.33
30
Language ModelingWikiText (held-out)
Perplexity (PPL)9.8
25
Language ModelingWikiText-103
Throughput (tokens/s)159,000
21
Language ModelingWikiText (WT)
Relative PPL Change (%)31
16
Language ModelingWikiText
PPL Change (%)1.7
16
Language ModelingWikiText-103
Bits Per Character (BPC)2
13
Language ModelingWikiText-2 vLLM harness (test)
Perplexity (PPL)8.87
12
Language ModelingWikitext zero-shot
Perplexity25.75
12
Privacy MeasurementWikiText
Epsilon0
12
Open-ended Text GenerationWikitext (test)
Diversity (DIV)95
12
Language ModelingWikiText-103 20w x 2048
Perplexity (PPL)9.603
10
Prefilling ProfilingWikiText (test)
Time (s)38
10
Language ModelingWikiText 1,000-example evaluation slice (test)
Perplexity12.723
9
Language ModelingWikiText zero-shot transfer (test)
Perplexity33.22
8
Language ModelingWikiText (test)
ROUGE Score64.14
8
Language ModelingWikiText 1K
Perplexity13.8
7
Language Modelingwikitext
Perplexity (word)8.0668
6
Model Compression TimeWikiText
Compression Time (s)196.34
6
Showing 25 of 48 rows