Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Winograd Schema Challenge

Benchmarks

Task NameDataset NameSOTA ResultTrend
Pronoun DisambiguationWinograd Schema Challenge
Accuracy90.1
27
Commonsense ReasoningWinograd Schema Challenge (WSC) (test)
Accuracy75.1
17
Commonsense ReasoningHebrew Winograd Schema Challenge
Accuracy83.45
11
Common Sense ReasoningWinograd Schema Challenge 273 sentences (original)
Accuracy61.5
8
Showing 4 of 4 rows