SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization
About
In this paper, we present a conceptually simple while empirically powerful framework for abstractive summarization, SimCLS, which can bridge the gap between the learning objective and evaluation metrics resulting from the currently dominated sequence-to-sequence learning framework by formulating text generation as a reference-free evaluation problem (i.e., quality estimation) assisted by contrastive learning. Experimental results show that, with minor modification over existing top-scoring systems, SimCLS can improve the performance of existing top-performing models by a large margin. Particularly, 2.51 absolute improvement against BART and 2.50 over PEGASUS w.r.t ROUGE-1 on the CNN/DailyMail dataset, driving the state-of-the-art performance to a new level. We have open-sourced our codes and results: https://github.com/yixinL7/SimCLS. Results of our proposed models have been deployed into ExplainaBoard platform, which allows researchers to understand our systems in a more fine-grained way.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Summarization | XSum (test) | ROUGE-224.57 | 231 | |
| Abstractive Text Summarization | CNN/Daily Mail (test) | ROUGE-L43.54 | 169 | |
| Summarization | Xsum | ROUGE-224.6 | 108 | |
| Summarization | CNN/DM | ROUGE-146.67 | 56 | |
| Abstractive Summarization | XSum (test) | ROUGE-L39.44 | 44 | |
| Text Summarization | CNNDM | ROUGE-222.15 | 11 | |
| Abstractive Summarization | CNNDM (test) | ROUGE-146.67 | 11 | |
| Summarization | CNN/DM human evaluation | -- | 4 | |
| Summarization | XSum 100 sample subset (test) | Informativeness (Pairwise Lose)7 | 2 |