Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WikiText

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWikiText-2 (test)
PPL2.56
1,541
Language ModelingWikiText
PPL2.92
479
Language ModelingWikiText (test)
Perplexity5.49
52
Language ModelingWikiText (val)
Perplexity21.14
34
Language ModelingWikiText (held-out)
Perplexity (PPL)9.8
25
Language ModelingWikiText v1 (test)
Perplexity13.33
18
Privacy MeasurementWikiText
Epsilon0
12
Open-ended Text GenerationWikitext (test)
Diversity (DIV)95
12
Language ModelingWikiText
Perplexity (Baseline)9.91
11
Prefilling ProfilingWikiText (test)
Time (s)38
10
Language ModelingWikitext zero-shot
Perplexity25.75
10
Language ModelingWikiText (test)
ROUGE Score64.14
8
Language ModelingWikiText 1K
Perplexity13.8
7
Membership Inference AttackWikiText
TPR @ 0.1% FPR14
6
Knowledge EvaluationWikiText (eval)
BPB0.777
6
Masked ReconstructionWikiText-103
PPL4.94
5
Text GenerationWikitext
Coherence: CD Better Rate88.7
4
Language ModelingWikitext
Accuracy28.75
3
Language ModelingWikiText 50 (test)
Normalized Energy0.96
3
Membership Inference AttackWikiText
AUC (LOSS)0.725
3
Open generationWikiText-103
Diversity75
3
Language ModelingWikiText
Accuracy17.2826
2
Handwriting Text RecognitionWikitext 2 column synthetic (test)
CER0.012
2
Handwriting Text RecognitionWikitext 1 column synthetic (test)
CER0.008
2
Text GenerationWikiText (val)
Perplexity (PPL)25.75
1
Showing 25 of 25 rows