Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Neural Summarization by Extracting Sentences and Words

About

Traditional approaches to extractive summarization rely heavily on human-engineered features. In this work we propose a data-driven approach based on neural networks and continuous sentence features. We develop a general framework for single-document summarization composed of a hierarchical document encoder and an attention-based extractor. This architecture allows us to develop different classes of summarization models which can extract sentences or words. We train our models on large scale corpora containing hundreds of thousands of document-summary pairs. Experimental results on two summarization datasets demonstrate that our models obtain results comparable to the state of the art without any access to linguistic annotation.

Jianpeng Cheng, Mirella Lapata• 2016

Related benchmarks

TaskDatasetResultRank
SummarizationarXiv (test)
ROUGE-142.24
161
SummarizationCNN/Daily Mail original, non-anonymized (test)
ROUGE-141.13
54
Abstractive SummarizationCNN/DailyMail full length F-1 (test)
ROUGE-140.11
48
Extractive SummarizationCNN/Daily Mail (test)
ROUGE-142.2
36
SummarizationCNNDM full-length F1 (test)
ROUGE-140.11
19
SummarizationDUC 2002 (test)
ROUGE-147.4
18
SummarizationPubMed 2018 (test)
ROUGE-143.89
15
Multimodal SummarizationDaily Mail
ROUGE-141.22
10
SummarizationCNN+DailyMail mixed (test)
ROUGE-135.5
9
Extractive SummarizationDailyMail 75 bytes (test)
ROUGE-122.7
7
Showing 10 of 15 rows

Other info

Follow for update