Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Commonsense Reasoning Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningCommonsense Reasoning Suite BoolQ, PIQA, HellaS, WinoG, ARC-e, ARC-c, OBQA
Average Accuracy70.39
37
Commonsense ReasoningCommonsense Reasoning Suite BoolQ, PIQA, HellaSwag, WinoGrande, ARC-e, ARC-c
BoolQ Accuracy83.03
28
Commonsense ReasoningCommonsense Reasoning Suite (test)
Avg Accuracy0.7418
22
Commonsense ReasoningCommonsense Reasoning Suite (PiQA, Arc-C, WinoGrande, HellaSwag, SciQ, OBQA, BoolQ, Arc-E) (test)
PiQA Accuracy80.79
15
Commonsense ReasoningCommonsense Reasoning Suite (PiQA, Arc-C, WinoGrande, HellaSwag, SciQ, OBQA, BoolQ, Arc-E)
PiQA Accuracy82.21
15
Question AnsweringCommonsense Reasoning Suite (ARC-e, ARC-c, BoolQ, OBQA, PIQA) (test)
ARC-e77.7
8
Showing 6 of 6 rows