Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Winogender

Benchmarks

Task NameDataset NameSOTA ResultTrend
Coreference ResolutionWinogender (test)
Accuracy80.7
11
Coreference ResolutionWinogender (WG) (test)
Accuracy80.8
11
Bias EvaluationWinoGender
EBS0.068
8
Commonsense ReasoningWinoGender
Accuracy0.9681
8
Multiple-choice scoringWinogender
Accuracy0.847
7
Co-reference resolutionWinoGender
Accuracy (All)77.5
4
Pronoun Coreference ResolutionWinogender (test)
Accuracy64.5
3
Coreference ResolutionWinogender
Accuracy62.9
3
Showing 8 of 8 rows