| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Next Token Prediction | QMSum | Next Token Accuracy47 | 32 | |
| Query-based meeting summarization | QMSum (test) | ROUGE-143.8 | 26 | |
| Summarization | QMSum (val) | ROUGE-L0.2378 | 17 | |
| Abstractive Summarization | QMSum | BLEU6.75 | 11 | |
| Synthetic Text Generation | QMSum | Mean Embedding Similarity52 | 10 | |
| Document Summarization | QMSum (test) | ROUGE-138.9 | 10 | |
| Summarization | QMSum | Std Dev ROUGE-10.3 | 8 | |
| Query-focused Summarization | QMSum (test) | ROUGE-138.06 | 7 | |
| Query-based Meeting Summarization | QMSum | ROUGE-L10 | 6 | |
| Query-focused Meeting Summarization | QMSum 50 samples | Fluency4.88 | 6 | |
| Summarization | QMSum (test) | Fluency4.93 | 5 | |
| Next Token Prediction | QMSum | Acc (BERT-Small, Epsilon=Inf)32.82 | 4 | |
| Abstractive Meeting Summarization | QMSum | Coreference1.67 | 4 | |
| Meeting Summarization | QMSum (all turns) | ROUGE-134.03 | 4 | |
| Meeting Summarization | QMSum Gold turns only | ROUGE-140.2 | 3 | |
| Query-based Summarization | QMSum SCROLLS (val) | ROUGE-130.9 | 2 | |
| Transcript Challenge Assessment | QMSum (test) | Spoken Language Score3 | 1 | |
| Meeting Transcript Evaluation | QMSum | Coherence4.5 | 1 |