Disordered-DABS: A Benchmark for Dynamic Aspect-Based Summarization in Disordered Texts
About
Aspect-based summarization has seen significant advancements, especially in structured text. Yet, summarizing disordered, large-scale texts, like those found in social media and customer feedback, remains a significant challenge. Current research largely targets predefined aspects within structured texts, neglecting the complexities of dynamic and disordered environments. Addressing this gap, we introduce Disordered-DABS, a novel benchmark for dynamic aspect-based summarization tailored to unstructured text. Developed by adapting existing datasets for cost-efficiency and scalability, our comprehensive experiments and detailed human evaluations reveal that Disordered-DABS poses unique challenges to contemporary summarization models, including state-of-the-art language models such as GPT-3.5.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dynamic Aspect-Based Summarization | D-WikiHow (test) | #AbsAspDiff5.5 | 5 | |
| Dynamic Aspect-Based Summarization | D-CnnDM (test) | #AbsAspDiff1.3 | 5 | |
| Dynamic Aspect-Based Summarization | OASUM (test) | AbsAspDiff1 | 5 | |
| Aspect-based Summarization | D-CnnDM (test) | Coherence3.5 | 4 | |
| Aspect Number Prediction | D-CnnDM (test) | Absolute Aspect Number Difference1.28 | 3 | |
| Aspect Number Prediction | D-WikiHow (test) | Abs Aspect Number Difference2.69 | 3 | |
| Aspect Number Prediction | OASUM (test) | Absolute Aspect Number Difference1.48 | 3 |