Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WinoGrande

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningWinoGrande
Accuracy7,364
1,442
Commonsense ReasoningWinogrande
Accuracy85.3
453
Common sense reasoningWinogrande
Accuracy91.3
189
ReasoningWinoGrande (WG)
Accuracy85.2
168
Question AnsweringWinogrande (WG)
Accuracy74.1
138
Commonsense ReasoningWinogrande
Accuracy76.09
103
Commonsense ReasoningWinoGrande (val)
Accuracy73.88
87
Commonsense ReasoningWinoGrande 5-shot
Accuracy92.66
85
Commonsense Question AnsweringWinoGrande
Accuracy77.82
73
Commonsense ReasoningWinogrande
Accuracy69.77
68
Coreference ResolutionWinogrande
Accuracy73.6
61
Pronoun ResolutionWinoGrande
Accuracy89.4
58
Zero-shot ReasoningWinoGrande
Accuracy70
54
Commonsense ReasoningWinogrande
Accuracy (0-shot)73.7
42
Winograd Schema ChallengeWinoGrande
Accuracy76.56
39
Commonsense ReasoningWinoGrande standard (test)
Accuracy80.2
39
Language UnderstandingWinoGrande
Accuracy80.82
38
Commonsense reasoningWinoGrande 1.0 (test)
Accuracy74.1
31
Identification of inactive attention headsWinoGrande
Percentage of Zeroed Heads20.91
30
Natural Language UnderstandingWinogrande
Accuracy59
30
Zero-shot AccuracyWinoGrande
Zero-shot Accuracy77.3
30
Commonsense ReasoningWinogrande
Accuracy80.71
24
Commonsense ReasoningWinogrande
Accuracy82.35
23
Commonsense ReasoningWinogrande
Winogrande Score73.88
22
Commonsense Question AnsweringWinoGrande (WG) (val)
Accuracy78.3
21
Showing 25 of 82 rows