| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WSC | UPA | Accuracy98.5 | 116 | 2d ago | |
| CoNLL English 2012 (test) | MUC F1 Score88 | 114 | 3mo ago | ||
| Winogrande | Accuracy73.6 | 61 | 22h ago | ||
| GAP (test) | longdoc | Overall F189.9 | 53 | 3mo ago | |
| OntoNotes | GPT-4 | MUC93.7 | 46 | 26d ago | |
| Winograd WSC273 (test) | Fine-tuned SOTA | Accuracy90.1 | 34 | 3mo ago | |
| WSC | DisambiguSLM | Accuracy@185.2 | 33 | 1mo ago | |
| CoreRes | Accuracy94.79 | 33 | 3mo ago | ||
| LitBank (test) | ImCoref-CeS | Avg. F181.8 | 30 | 21d ago | |
| LitBank 1.0 (test) | longdoc | CoNLL F181 | 27 | 3mo ago | |
| XWinograd | LANG | Accuracy79.9 | 26 | 12d ago | |
| SIMMC 2.1 | Qwen3 | Precision59.98 | 22 | 1mo ago | |
| WSC | HIZOO | Loss0.02 | 20 | 2d ago | |
| CLUEWSC | C-DPO | EM90.98 | 20 | 1mo ago | |
| WSC (test) | PromptAgent | Accuracy82.7 | 19 | 2d ago | |
| XWinograd French | EuroLM | Score69.9 | 18 | 1mo ago | |
| Winograd | PaLM 2-M | Accuracy90.5 | 18 | 2mo ago | |
| English OntoNotes 5.0 (test) | MUC Precision88.6 | 18 | 3mo ago | ||
| CoNLL 2012 | Average F183.1 | 17 | 3mo ago | ||
| LitBank 1.0 (dev) | U-MEM | CoNLL F180.5 | 15 | 3mo ago | |
| Winogrande XL | T0-11B | Accuracy60.5 | 13 | 3mo ago | |
| WSC | KiC-Large | Accuracy65.4 | 13 | 3mo ago | |
| OntoNotes 5.0 (dev) | Wu et al. | CoNLL F183.4 | 13 | 3mo ago | |
| CRAC mini Shared Task 2026 (test) | CorPipeEnsemble | Head-match77.11 | 12 | 5d ago | |
| GENIA coreference | Fine-tuned Constrained Macaw-3B | Macro F191.6 | 12 | 2mo ago |