Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SocialIQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningSocialIQA
Accuracy88.1
97
Social Commonsense ReasoningSocialIQA
Accuracy87.11
68
Commonsense Question AnsweringSocialIQA (SIQA) (val)
Accuracy70.7
24
Question AnsweringSocialIQA
Accuracy83.9
16
Ranking correlation with full dataset evaluationSocialIQA
Kendall Correlation0.81
10
Scaling Law PredictionSocialIQA
MAE0.0088
7
Preference alignmentSocialIQA
Preference Alignment87.3
5
AdaptivitySocialIQA
Adaptivity75
4
Commonsense ReasoningSOCIALIQA (dev)
Accuracy73.8
3
Showing 9 of 9 rows