Iterative Document Representation Learning Towards Summarization with Polishing

About

In this paper, we introduce Iterative Text Summarization (ITS), an iteration-based model for supervised extractive text summarization, inspired by the observation that it is often necessary for a human to read an article multiple times in order to fully understand and summarize its contents. Current summarization approaches read through a document only once to generate a document representation, resulting in a sub-optimal representation. To address this issue we introduce a model which iteratively polishes the document representation on many passes through the document. As part of our model, we also introduce a selective reading mechanism that decides more accurately the extent to which each sentence in the model should be updated. Experimental results on the CNN/DailyMail and DUC2002 datasets demonstrate that our model significantly outperforms state-of-the-art extractive systems when evaluated by machines and by humans.

Xiuying Chen, Shen Gao, Chongyang Tao, Yan Song, Dongyan Zhao, Rui Yan• 2018

Related benchmarks

Task	Dataset	Result
Extractive Summarization	CNN/Daily Mail (test)	ROUGE-142.4	36
Summarization	CNN/DailyMail (test)	--	33
Summarization	DUC 2002 (test)	ROUGE-147.6	18
Extractive Summarization	DailyMail 75 bytes (test)	ROUGE-127.4	7
Extractive Summarization	CNN (test)	ROUGE-130.8	5

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord