On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

About

We present a method to produce abstractive summaries of long documents that exceed several thousand words via neural abstractive summarization. We perform a simple extractive step before generating a summary, which is then used to condition the transformer language model on relevant information before being tasked with generating a summary. We show that this extractive step significantly improves summarization results. We also show that this approach produces more abstractive summaries compared to prior work that employs a copy mechanism while still achieving higher rouge scores. Note: The abstract above was not written by the authors, it was generated by one of the models presented in this paper.

Sandeep Subramanian, Raymond Li, Jonathan Pilault, Christopher Pal• 2019

Related benchmarks

Task	Dataset	Result
Summarization	arXiv (test)	ROUGE-146.4	161
Summarization	PubMed (test)	ROUGE-145.01	114
Summarization	arXiv	ROUGE-215.63	76
Summarization	Pubmed	ROUGE-146.32	70
Summarization	bigPatent	ROUGE-139.99	61
Summarization	Newsroom (test)	ROUGE-274	40
Summarization	arXiv original (test)	R-142.32	18
Single-Document Summarization	BIGPATENT (test)	ROUGE-136.41	16
Summarization	PubMed 2018 (test)	ROUGE-145.01	15
Summarization	Newsroom Mixed	ROUGE-138.6	11

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord