Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WikiText-103

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWikiText-103 (test)
Perplexity2.22
524
Language ModelingWikiText-103 (val)
PPL1.01
180
Language ModelingWikiText-103
PPL4.59
146
Word-level Language ModelingWikiText-103 word-level (test)
Perplexity15.79
65
Word-level language modelingWikiText-103 (dev)
Perplexity15.72
64
Language ModelingWikitext-103
PPL3.14
42
Language ModelingWikiText-103 zero-shot (test)
PPL12.76
34
Open-ended generationWikitext-103 (test)
MAUVE0.96
26
Steganographic secret extractionWikiText-103 W (test)
Accuracy75
20
Language ModelingWikiText-103
Delta PPL0.05
16
Text GenerationWikiText-103
Quality Better Count24
14
Membership Inference AttackWikiText-103
AUC0.784
14
Membership Inference AttackWikiText-103 (test)
AUC0.782
13
Open-ended Text GenerationWikitext-103
PPL2.55
10
Language ModelingWikiText-103 small setting (test)
Perplexity32.8
10
Language ModelingWikiText-103 small setting (val)
Perplexity31.8
10
Autoregressive Language ModelingWikitext-103
PPL18.5
9
Data AttributionWikiText-103 (test)
Tail-patch Score7.88
9
Data AttributionWikiText-103
LDS18.33
8
Language ModelingWikiText-103 (dev)
Perplexity17.9
8
Language ModelingWikiText-103
PPL6.88
8
Document ContinuationWikiText-103 (test)
MAUVE13.96
8
Repetition reductionWikiText-103 (test)
PPL23.82
8
Open-ended text generationWikitext-103 (test)
Win Rate84
8
Autoregressive language modelingWikiText-103 1024 (test)
PPL19
7
Showing 25 of 37 rows