Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning and Knowledge Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense Reasoning and Knowledge UnderstandingCommonsense Reasoning and Knowledge Suite (ARC, HellaSwag, LAMBADA, PIQA, WinoGrande, MMLU)
ARC-e Accuracy83.42
13
Showing 1 of 1 rows