Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LAMBADA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingLAMBADA
Accuracy86.9
268
Language ModelingLAMBADA
Perplexity3.1
150
Word PredictionLAMBADA
Accuracy86.9
148
Language ModelingLambada OpenAI
Accuracy70.83
127
Language ModelingLAMBADA
Accuracy79.4
76
Language ModelingLAMBADA (test)
Accuracy88.61
71
Word PredictionLAMBADA (test)
Accuracy87.15
53
Language ModelingLAMBADA zero-shot (test)
Accuracy (zero-shot)69.12
44
Language ModelingLambada Standard
Accuracy60.8
36
Language ModelingLAMBADA standard (LS)
Accuracy (LAMBADA)65.57
30
Word PredictionLAMBADA OpenAI
Accuracy71.4
26
Language ModelingLAMBADA
Delta (%)43.3
25
Language ModelingLambada (val)
Perplexity12.37
24
Reading ComprehensionLambada
Accuracy80.5
24
Language ModelingLAMBADA multilingual (test)
LAMBADA Score (DE)140.97
20
Word PredictionLAMBADA standard
Accuracy65.57
20
Cloze-style completionLambada OpenAI
Accuracy75.65
20
Word PredictionLAMBADA CONTROL (all)
Accuracy36
20
Language ModelingLAMBADA (dev)
Perplexity12.34
20
Language ModelingLambada
EM Accuracy89.7
18
Language ModelingLambada (OpenAI split)
PPL3.11
13
Reading ComprehensionLAMBADA (test)
Accuracy66.51
13
Word PredictionLAMBADA CONTROL (context)
Accuracy65.6
13
Language ModelingLAMBADA (control)
Perplexity94
12
Word PredictionLambada
Accuracy (Original)82.2
11
Showing 25 of 46 rows