Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HSWAG

Benchmarks

Task NameDataset NameSOTA ResultTrend
Common Sense ReasoningHSWAG
Accuracy0.9751
52
Commonsense ReasoningHSWAG Out-of-Domain (test)
Accuracy42.88
8
Commonsense ReasoningHSwag
Normalized PLL Score27.8
4
Commonsense ReasoningHSWAG French (test)
Accuracy33.5
4
Commonsense ReasoningHSWAG German (test)
Accuracy28.78
4
Showing 5 of 5 rows