Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Felix: Flexible Text Editing Through Tagging and Insertion

About

We present Felix --- a flexible text-editing approach for generation, designed to derive the maximum benefit from the ideas of decoding with bi-directional contexts and self-supervised pre-training. In contrast to conventional sequence-to-sequence (seq2seq) models, Felix is efficient in low-resource settings and fast at inference time, while being capable of modeling flexible input-output transformations. We achieve this by decomposing the text-editing task into two sub-tasks: tagging to decide on the subset of input tokens and their order in the output text and insertion to in-fill the missing tokens in the output not present in the input. The tagging model employs a novel Pointer mechanism, while the insertion model is based on a Masked Language Model. Both of these models are chosen to be non-autoregressive to guarantee faster inference. Felix performs favourably when compared to recent text-editing methods and strong seq2seq baselines when evaluated on four NLG tasks: Sentence Fusion, Machine Translation Automatic Post-Editing, Summarization, and Text Simplification.

Jonathan Mallinson, Aliaksei Severyn, Eric Malmi, Guillermo Garrido• 2020

Related benchmarks

TaskDatasetResultRank
Sentence SimplificationTurkCorpus English (test)
SARI38.13
41
SummarizationSummarization dataset
ROUGE-L F167.8
16
ASR Error CorrectionAISHELL-1 (test)
WER4.63
6
ASR Error CorrectionAISHELL-1 (dev)
WER4.26
6
ASR Error Correctioninternal dataset (test)
WER11.14
6
ASR Error CorrectionInternal Dataset (dev)
WER11.21
6
Sentence FusionDiscoFuse balanced Wikipedia (test)
Exact Match61.3
6
Showing 7 of 7 rows

Other info

Follow for update