| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| XSum (test) | ROUGE-260.61 | 246 | 8d ago | ||
| arXiv (test) | Top Down Transformer | ROUGE-164.16 | 161 | 1mo ago | |
| Xsum | ST-MoE | ROUGE-227.1 | 108 | 1mo ago | |
| PubMed (test) | ORACLE | ROUGE-161.99 | 107 | 1mo ago | |
| Arxiv | ROUGE-223.05 | 76 | 1mo ago | ||
| PubMed | LongT5 | ROUGE-150.23 | 70 | 1mo ago | |
| CNN Daily Mail | PEGASUS-2B (calibrated) | ROUGE-147.97 | 67 | 1mo ago | |
| CNNDM | Diversed | ROUGE-212.64 | 62 | 8d ago | |
| bigPatent | OracleFrag | ROUGE-191.85 | 61 | 1mo ago | |
| CNN/DM | ROUGE-156.22 | 56 | 1mo ago | ||
| CNN/Daily Mail original, non-anonymized (test) | Best Previous Abstractive | ROUGE-141.69 | 54 | 1mo ago | |
| LongBench | GovRep Score33.39 | 51 | 1mo ago | ||
| TL;DR (test) | GRPO | Win Rate82.5 | 49 | 1mo ago | |
| XSum | ROUGE-29.16 | 46 | 8d ago | ||
| TL;DR | SignCert-PO | Winrate91.8 | 42 | 12d ago | |
| Newsroom (test) | TLM+E (G,G) | ROUGE-274 | 40 | 1mo ago | |
| Gigaword (test) | Aghajanyan et al. | ROUGE-220.7 | 38 | 1mo ago | |
| Gigaword | UNIMO | ROUGE-L36.88 | 38 | 1mo ago | |
| Newsroom (test) | MARS (default) | Pearson Correlation0.372 | 36 | 1mo ago | |
| CNN/DM | DOUBLE | M Score10.64 | 35 | 1mo ago | |
| CNN/DailyMail (test) | FreeTxt-Vi (fine-tuned Qwen2.5) | ROUGE-L48 | 33 | 1mo ago | |
| CNN/DM | TALON | Speedup3.58 | 32 | 1mo ago | |
| CNNDM (test) | ROUGE 237.82 | 31 | 11d ago | ||
| SAMSum Full 2019 | CIPHER | F1 Score37 | 30 | 1mo ago | |
| SAMSum | CriSPO | BERTScore F191.3 | 30 | 1mo ago |