Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LAMBADA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingLAMBADA
Accuracy86.9
412
Language ModelingLAMBADA
Perplexity3.1
198
Word PredictionLAMBADA
Accuracy86.9
192
Language ModelingLambada OpenAI
Accuracy70.83
127
Language ModelingLAMBADA (test)
Perplexity4
109
Language ModelingLAMBADA
Accuracy79.4
103
Language ModelingLambada
Perplexity (Lambada)14.36
70
Word PredictionLAMBADA (test)
Accuracy87.15
53
Language ModelingLAMBADA zero-shot (test)
Accuracy (zero-shot)69.12
44
Language ModelingLAMBADA
PPL Change (%)0.2
41
Language ModelingLambada (val)
Perplexity10.14
39
Language ModelingLambada Standard
Accuracy60.8
36
Language ModelingLAMBADA standard (LS)
Accuracy (LAMBADA)65.57
30
Word PredictionLAMBADA OpenAI
Accuracy71.4
29
Language ModelingLAMBADA
Delta (%)43.3
25
Reading ComprehensionLambada
Accuracy80.5
24
Language ModelingLambada (OpenAI split)
PPL3.11
22
Language ModelingLAMBADA multilingual (test)
LAMBADA Score (DE)140.97
20
Word PredictionLAMBADA standard
Accuracy65.57
20
Cloze-style completionLambada OpenAI
Accuracy75.65
20
Word PredictionLAMBADA CONTROL (all)
Accuracy36
20
Language ModelingLAMBADA (dev)
Perplexity12.34
20
Language ModelingLambada
EM Accuracy89.7
18
Language ModelingLAMBADA
PPL11.39
14
Question AnsweringLAMBADA
Accuracy73.2
14
Showing 25 of 66 rows