Ranking Sentences for Extractive Summarization with Reinforcement Learning
About
Single document summarization is the task of producing a shorter version of a document while preserving its principal information content. In this paper we conceptualize extractive summarization as a sentence ranking task and propose a novel training algorithm which globally optimizes the ROUGE evaluation metric through a reinforcement learning objective. We use our algorithm to train a neural summarization model on the CNN and DailyMail datasets and demonstrate experimentally that it outperforms state-of-the-art extractive and abstractive systems when evaluated automatically and by humans.
Shashi Narayan, Shay B. Cohen, Mirella Lapata• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text Summarization | CNN/Daily Mail (test) | ROUGE-218.2 | 65 | |
| Summarization | CNN/Daily Mail original, non-anonymized (test) | ROUGE-140 | 54 | |
| Abstractive Summarization | CNN/DailyMail full length F-1 (test) | ROUGE-140 | 48 | |
| Extractive Summarization | CNN/Daily Mail (test) | ROUGE-141 | 36 | |
| Summarization | CNN/DailyMail (test) | 1st Metric35 | 22 | |
| Summarization | CNNDM full-length F1 (test) | ROUGE-140 | 19 | |
| Summarization | CNN/Daily Mail full length (test) | ROUGE-140 | 18 | |
| Summarization | CNN+DailyMail mixed (test) | ROUGE-140 | 9 | |
| Extractive Summarization | DailyMail 75 bytes (test) | ROUGE-124.1 | 7 | |
| Summarization | CNN non-anonymized (test) | ROUGE-130.4 | 5 |
Showing 10 of 14 rows