| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| AMR-to-text generation | LDC2017T10 (test) | BLEU49.72 | 55 | |
| Long Document Classification | LDC benchmark | Overall Performance (HYP)93.8 | 7 | |
| Authorship Verification | LDC Harder | AUC0.935 | 6 | |
| Authorship Verification | LDC Hard | AUC87.2 | 6 | |
| Authorship Verification | LDC Base | AUC86.1 | 6 | |
| AMR Parsing | LDC2017T10 (test) | Smatch (ordinary)74.4 | 6 | |
| Data-to-Text Generation | LDC2017T10 | Fluency Score5.05 | 5 | |
| Machine Translation | LDC Chinese-English (test) | BLEU40.02 | 3 |