Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

XStoryCloze

Benchmarks

Task NameDataset NameSOTA ResultTrend
Story ReasoningXStoryCloze
Accuracy71
51
Commonsense ReasoningXStoryCloze
Average Score80.93
39
Story CompletionXStoryCloze
Accuracy67.9
20
Story CompletionXStoryCloze 1.0 (test)
XStoryCloze Accuracy (en)71.5
18
Commonsense ReasoningXStoryCloze
Accuracy (en)70.4
12
Reasoning and Knowledge AssessmentXstorycloze bo
Accuracy72.96
11
Story CompletionXStoryCloze Arabic
Accuracy (Normalized)59.3
10
Multilingual Story CompletionXStoryCloze
Extract Match63.5
4
Commonsense ReasoningXStoryCloze Māori
Accuracy-
0
Showing 9 of 9 rows