Source-side Prediction for Neural Headline Generation
About
The encoder-decoder model is widely used in natural language generation tasks. However, the model sometimes suffers from repeated redundant generation, misses important phrases, and includes irrelevant entities. Toward solving these problems we propose a novel source-side token prediction module. Our method jointly estimates the probability distributions over source and target vocabularies to capture a correspondence between source and target tokens. The experiments show that the proposed model outperforms the current state-of-the-art method in the headline generation task. Additionally, we show that our method has an ability to learn a reasonable token-wise correspondence without knowing any true alignments.
Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text Summarization | Gigaword (test) | ROUGE-146.77 | 75 | |
| Machine Translation | IWSLT En-Fr 2016 (test) | BLEU34.37 | 9 | |
| Machine Translation | IWSLT En-De 2016 (test) | BLEU23.05 | 3 | |
| Machine Translation | IWSLT De-En 2016 (test) | BLEU28.18 | 3 | |
| Machine Translation | IWSLT Fr-En 2016 (test) | BLEU Score34.07 | 3 |
Showing 5 of 5 rows