Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WikiText-103

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWikiText-103 (test)
Perplexity2.22
703
Language ModelingWikiText-103 (val)
PPL1.01
261
Language ModelingWikiText-103
PPL4.59
216
Word-level Language ModelingWikiText-103 word-level (test)
Perplexity15.79
65
Word-level language modelingWikiText-103 (dev)
Perplexity15.72
64
Language ModelingWikiText-103 v1 (test)
Perplexity10.48
56
Text ContinuationWikiText-103 512-token continuation (test)
Perplexity (PPL)1
47
Language ModelingWikiText-103
Perplexity (PPL)5.47
43
Language ModelingWikitext-103
PPL3.14
42
Language GenerationWikiText-103
Perplexity (PPL)1
41
Language ModelingWikiText-103 zero-shot (test)
PPL12.76
34
Language ModelingWikitext-103
Perplexity (PPL)14.8
28
Open-ended generationWikitext-103 (test)
MAUVE0.96
26
TokenizationWikiText-103
Latency (ms)1.92
25
Language ModelingWikiText-103
Delta PPL0
25
Text generationWikitext-103
Perplexity32.88
23
Steganographic secret extractionWikiText-103 W (test)
Accuracy75
20
Language ModelingWikiText-103 small setting (test)
Perplexity32.8
20
Language ModelingWikiText-103 small setting (val)
Perplexity31.8
20
Language ModelingWikiText-103 (train)
PPL15.18
19
Language ModelingWikiText-103
Mauve87
18
Language ModelingWikiText-103
Base Score173.704
18
Language ModelingWikiText-103
Perplexity15.33
17
Language ModelingWikiText-103
Perplexity (PPL)72
15
Text GenerationWikiText-103
ROUGE-10.391
15
Showing 25 of 84 rows