SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration
About
Dialogue systems in open domain have achieved great success due to the easily obtained single-turn corpus and the development of deep learning, but the multi-turn scenario is still a challenge because of the frequent coreference and information omission. In this paper, we investigate the incomplete utterance restoration which has brought general improvement over multi-turn dialogue systems in recent studies. Meanwhile, jointly inspired by the autoregression for text generation and the sequence labeling for text editing, we propose a novel semi autoregressive generator (SARG) with the high efficiency and flexibility. Moreover, experiments on two benchmarks show that our proposed model significantly outperforms the state-of-the-art models in terms of quality and inference speed.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Incomplete Utterance Restoration | Restoration-200k (test) | Inference Time50 | 6 | |
| Incomplete Utterance Restoration | CANARD (test) | BLEU54.8 | 5 | |
| Utterance Restoration | Restoration-200k (test) | F1 Score62.4 | 5 | |
| Incomplete Utterance Restoration | Restoration-200k Human Evaluation (200 samples) | Quality Score2.7 | 4 | |
| Incomplete Utterance Restoration | CANARD (dev) | BLEU56.93 | 4 |