Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

XStoryCloze

Benchmarks

Task NameDataset NameSOTA ResultTrend
Story ReasoningXStoryCloze
Accuracy58.5
35
Commonsense ReasoningXStoryCloze
Average Score80.93
32
Story CompletionXStoryCloze
Accuracy67.9
20
Commonsense ReasoningXStoryCloze
Accuracy (en)70.4
12
Commonsense ReasoningXStoryCloze Māori
Accuracy-
0
Showing 5 of 5 rows