Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WinoGrande

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningWinoGrande
Accuracy94.1
776
Commonsense ReasoningWinogrande
Accuracy85.3
231
Common sense reasoningWinogrande
Accuracy91.3
156
Question AnsweringWinogrande (WG)
Accuracy72.77
98
Commonsense ReasoningWinoGrande (val)
Accuracy73.88
87
ReasoningWinoGrande (WG)
Accuracy85.2
87
Commonsense ReasoningWinogrande
Accuracy69.77
45
Commonsense ReasoningWinogrande
Accuracy (0-shot)73.7
42
Commonsense ReasoningWinogrande
Accuracy76.09
38
Coreference ResolutionWinogrande
Accuracy73.2
36
Commonsense ReasoningWinoGrande standard (test)
Accuracy80.2
35
Pronoun ResolutionWinoGrande
Accuracy89.4
35
Zero-shot AccuracyWinoGrande
Zero-shot Accuracy77.3
30
Zero-shot ReasoningWinoGrande
Accuracy69
23
Commonsense Question AnsweringWinoGrande (WG) (val)
Accuracy78.3
21
LLM Performance EstimationWinogrande (test)
MAE1.027
20
Commonsense ReasoningWinogrande
Accuracy0.8624
19
Commonsense ReasoningWinogrande
Accuracy (Pre-Attack)73.2
19
Commonsense ReasoningWinoGrande 5-shot
Accuracy92.66
18
Zero-shot PredictionWinogrande
Accuracy69.06
17
Natural Language UnderstandingWinogrande
Accuracy53.75
15
Commonsense reasoningWinoGrande 1.0 (test)
Accuracy0.8137
15
Coreference ResolutionWinogrande XL
Accuracy60.5
13
ReasoningWinogrande
Accuracy Improvement2.14
12
Commonsense ReasoningWinogrande
LIS3.4756
10
Showing 25 of 54 rows