| SAMSum (test) | InstructDS | ROUGE-233 | | 80 | 3mo ago |
| DialogSum | | R-L48.36 | | 24 | 11d ago |
| ToFuEval | NWCAD | ToFuEval Score83.12 | | 18 | 1mo ago |
| SAMSum 1.0 (test) | | R151 | | 11 | 3mo ago |
| SAMSum | PEGASUS-2B (calibrated) | ROUGE-229.88 | | 10 | 3mo ago |
| AMI (test) | SUMM^N | Conciseness4.13 | | 9 | 3mo ago |
| TODSum (test) | InstructDS | ROUGE-189.2 | | 7 | 3mo ago |
| TODSum | InstructDS | ROUGE-189.3 | | 7 | 3mo ago |
| SAMSum Multiple Client (test) | Conf | ROUGE-1 (Client 1)49.99 | | 6 | 5d ago |
| DialogSum Single Client | | ROUGE-150.92 | | 6 | 5d ago |
| SAMSum Single Client | | ROUGE-150.59 | | 6 | 5d ago |
| SAMSum 200 samples (test) | ChatGPT | Faithfulness4.94 | | 6 | 3mo ago |
| SAMSum 30 samples (test) | ChatGPT | Faithfulness4.52 | | 6 | 3mo ago |
| TweetSumm (test) | DIONYSUS | ROUGE-130.7 | | 6 | 3mo ago |
| Email (test) | DIONYSUS | ROUGE-1 Score28.9 | | 6 | 3mo ago |
| Reddit (test) | DIONYSUS | ROUGE-124.8 | | 6 | 3mo ago |
| TVMegaSite | BART-LS | ROUGE-151.8 | | 6 | 3mo ago |
| SAMSum All-possible Names (test) | Ins | R228.44 | | 4 | 3mo ago |
| SAMSum In-distribution Names (test) | Ins | R228.79 | | 4 | 3mo ago |
| DialogSum 50 samples (test) | | Informativeness4.03 | | 3 | 3mo ago |
| SAMSum 50 samples (test) | | Informativeness4 | | 3 | 3mo ago |
| SAMSum (val) | Pioneer Agent (Qwen3-8B) | ROUGE-225.4 | | 2 | 1mo ago |
| ICSI (test) | SUMM^N | Readability4.12 | | 2 | 3mo ago |