| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Named Entity Recognition | CoNLL 2003 (test) | F1 Score98.31 | 539 | |
| Grammatical Error Correction | CoNLL 2014 (test) | F0.5 Score72.2 | 207 | |
| Named Entity Recognition | CoNLL English 2003 (test) | F1 Score94.6 | 135 | |
| Coreference Resolution | CoNLL English 2012 (test) | MUC F1 Score88 | 114 | |
| Named Entity Recognition | CoNLL 03 | F1 (Entity)94.6 | 102 | |
| Named Entity Recognition | CoNLL Spanish NER 2002 (test) | F1 Score95.9 | 98 | |
| Chunking | CoNLL 2000 (test) | F1 Score97.3 | 88 | |
| Named Entity Recognition | CoNLL Dutch 2002 (test) | F1 Score95.7 | 87 | |
| Named Entity Recognition | Conll 2003 | F1 Score96.77 | 86 | |
| Named Entity Recognition | CoNLL German 2003 (test) | F1 Score88.34 | 78 | |
| Semantic Role Labeling | CoNLL 2012 (test) | F1 Score88.59 | 49 | |
| Relation Extraction | CONLL04 | Relation Strict F178.84 | 43 | |
| Span-based Semantic Role Labeling | CoNLL 2005 (Out-of-domain (Brown)) | F1 Score85.1 | 41 | |
| Semantic Role Labeling | CoNLL 2005 (WSJ) | F1 Score95.5 | 41 | |
| Named Entity Recognition | CoNLL 2003 (dev) | F1 Score97.21 | 40 | |
| Semantic Role Labeling | CoNLL Brown 2005 (test) | F193.7 | 40 | |
| Grammatical Error Correction | CoNLL 2014 | F0.565.2 | 39 | |
| Joint Entity and Relation Extraction | CONLL04 | Entity F193.26 | 33 | |
| Dependency Semantic Role Labeling | CoNLL 2009 (test) | F1 Score93.03 | 32 | |
| Semantic Role Labeling | CoNLL WSJ English benchmark 2009 (test) | F1 Score92.83 | 31 | |
| Semantic Role Labeling | CoNLL 2005 (Brown) | F1 Score92.1 | 31 | |
| Semantic Role Labeling | CoNLL WSJ 2005 (test) | Precision87.13 | 29 | |
| Relation Extraction | CoNLL04 (test) | F1 Score75.8 | 28 | |
| Semantic Role Labeling | CoNLL English Brown 2009 (test) | F1 Score86.05 | 28 | |
| Named Entity Recognition | CoNLL (test) | F1 Score (de)81.12 | 28 |