| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Summarization | CNN/DM | ROUGE-156.22 | 56 | |
| Membership Inference Attack | CNN-DM | AUC0.974 | 36 | |
| Summarization | CNN/DM | M Score10.64 | 35 | |
| Text Summarization | CNN/DM | MAT4.23 | 34 | |
| Summarization | CNN/DM | Speedup3.58 | 32 | |
| Summarization | CNN/DM | MAT Score6.51 | 30 | |
| Readability Style Transfer | CNN/DM (test) | FRE Delta27.42 | 25 | |
| Summarization | CNN/DM | ROUGE-151.85 | 21 | |
| Text summarization | CNN/DM | TPS Score217.02 | 20 | |
| Summarization | CNN/DM | Speedup (vs AR)2.08 | 19 | |
| Summarization | CNN/DM | ROUGE-L37.52 | 18 | |
| Summarization | CNN/DM | Spd Score1.9 | 18 | |
| Extractive Summarization | CNN/DM (test) | ROUGE-152.59 | 18 | |
| Summarization | CNN/DM 55w 1,000 samples (test) | ROUGE-1 F135.8 | 16 | |
| Summarization | CNN/DM | ROUGE-1 F135.8 | 16 | |
| Text Summarization | CNN/DM | ROUGE-216.95 | 16 | |
| Summarization | CNN-DM | Context Influence149.33 | 15 | |
| Hallucination Detection | CNN/DM | AUROC76.14 | 14 | |
| Input OOD Detection | CNN/DM | AUROC99.89 | 14 | |
| Abstractive Summarization | CNN/DM | ROUGE-142.05 | 14 | |
| News Summarization | CNN/DM | ROUGE-144.59 | 13 | |
| Text Summarization | CNN DM | TPS50.17 | 13 | |
| Context Attribution | CNN/DM random subset of 10,000 samples | Log-Probability Drop1.129 | 12 | |
| Summarization Faithfulness | CNN/DM | SummaC52.85 | 12 | |
| Abstractive Summarization | CNN/DM sampled (test) | ROUGE Score22.86 | 12 |