SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration

About

Dialogue systems in open domain have achieved great success due to the easily obtained single-turn corpus and the development of deep learning, but the multi-turn scenario is still a challenge because of the frequent coreference and information omission. In this paper, we investigate the incomplete utterance restoration which has brought general improvement over multi-turn dialogue systems in recent studies. Meanwhile, jointly inspired by the autoregression for text generation and the sequence labeling for text editing, we propose a novel semi autoregressive generator (SARG) with the high efficiency and flexibility. Moreover, experiments on two benchmarks show that our proposed model significantly outperforms the state-of-the-art models in terms of quality and inference speed.

Mengzuo Huang, Feng Li, Wuhe Zou, Weidong Zhang• 2020

Related benchmarks

Task	Dataset	Result
Incomplete Utterance Restoration	Restoration-200k (test)	Inference Time50	6
Incomplete Utterance Restoration	CANARD (test)	BLEU54.8	5
Utterance Restoration	Restoration-200k (test)	F1 Score62.4	5
Incomplete Utterance Restoration	Restoration-200k Human Evaluation (200 samples)	Quality Score2.7	4
Incomplete Utterance Restoration	CANARD (dev)	BLEU56.93	4

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord