Zero-shot Commonsense Reasoning

Benchmarks

Dataset Name	SOTA Method	Metric
ARC-Easy, ARC-Challenge, HellaSwag, PIQA, WinoGrande lm-evaluation-harness (test)		ARC-e Accuracy82.87	43	2mo ago
Commonsense Reasoning Suite		BoolQ Accuracy73.18	32	4mo ago
PIQA zero-shot		Accuracy76.93	28	29d ago
Commonsense Reasoning PIQA HellaSwag WinoGrande ARC-Easy OpenBookQA MathQA (test)		Zero-shot Accuracy59	21	2mo ago
Zero-shot tasks (test)		Average Accuracy61	12	4mo ago
Reasoning Suite Zero-shot (ARC-E, BoolQ, HSwag, LAMBADA, OBQA, PIQA, SocIQA, WinoGr.)	PathMoE	ARC-E Accuracy45.5	9	4mo ago
Standard Commonsense Reasoning Suite (HellaSwag, PIQA, ARC-e, ARC-c, Winogrande, BoolQ, LAMBADA)		HellaSwag Accuracy44.7	7	4mo ago
CSQA	SLEB-pruned LLaMA2-7B	PIQA83.19	6	4mo ago
Winogrande zero-shot		Accuracy (zero-shot)67.17	4	3mo ago

Showing 9 of 9 rows