A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

About

We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural approaches on this task as well as strong statistical machine translation-based systems with neural and task-specific features trained on the same data. Our analysis shows the superiority of convolutional neural networks over recurrent neural networks such as long short-term memory (LSTM) networks in capturing the local context via attention, and thereby improving the coverage in correcting grammatical errors. By ensembling multiple models, and incorporating an N-gram language model and edit features via rescoring, our novel method becomes the first neural approach to outperform the current state-of-the-art statistical machine translation-based approach, both in terms of grammaticality and fluency.

Shamil Chollampatt, Hwee Tou Ng• 2018

Related benchmarks

Task	Dataset	Result
Grammatical Error Correction	CoNLL 2014 (test)	F0.5 Score54.8	207
Grammatical Error Correction	JFLEG (test)	GLEU57.5	60
Grammatical Error Correction	JFLEG	GLEU57.5	47
Grammatical Error Correction	CoNLL 2014	F0.554.79	39
Grammatical Error Correction	CoNLL-10	F0.5 Score70.14	16
Grammatical Error Correction	JFLEG (dev)	F0.5 Score63.61	7
Grammatical Error Correction	CoNLL-10 SvH	F0.5 Score69.3	7

Showing 7 of 7 rows

Other info

Code

Follow for update

@wizwand_team Discord