SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization

About

In this paper, we present a conceptually simple while empirically powerful framework for abstractive summarization, SimCLS, which can bridge the gap between the learning objective and evaluation metrics resulting from the currently dominated sequence-to-sequence learning framework by formulating text generation as a reference-free evaluation problem (i.e., quality estimation) assisted by contrastive learning. Experimental results show that, with minor modification over existing top-scoring systems, SimCLS can improve the performance of existing top-performing models by a large margin. Particularly, 2.51 absolute improvement against BART and 2.50 over PEGASUS w.r.t ROUGE-1 on the CNN/DailyMail dataset, driving the state-of-the-art performance to a new level. We have open-sourced our codes and results: https://github.com/yixinL7/SimCLS. Results of our proposed models have been deployed into ExplainaBoard platform, which allows researchers to understand our systems in a more fine-grained way.

Yixin Liu, Pengfei Liu• 2021

Related benchmarks

Task	Dataset	Result
Summarization	XSum (test)	ROUGE-224.57	276
Abstractive Text Summarization	CNN/Daily Mail (test)	ROUGE-L43.54	169
Summarization	Xsum	ROUGE-224.6	108
Summarization	CNN/DM	ROUGE-146.67	56
Abstractive Summarization	XSum (test)	ROUGE-L39.44	44
Reranking	CNN/DailyMail	R-144.46	15
Text Summarization	CNNDM	ROUGE-222.15	11
Abstractive Summarization	CNNDM (test)	ROUGE-146.67	11
Summarization	CNN/DM human evaluation	--	4
Summarization	XSum 100 sample subset (test)	Informativeness (Pairwise Lose)7	2

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord