Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Common Sense Reasoning

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot Common Sense ReasoningCommon Sense Reasoning
Zero-shot Accuracy70.54
95
Common Sense Reasoning7 common sense reasoning datasets
Average Performance71.72
61
Common Sense ReasoningSix common sense reasoning tasks
Accuracy69.46
15
Common-sense ReasoningCommon-sense Reasoning Average
Average Accuracy58.84
11
Showing 4 of 4 rows