Document-Level Neural Machine Translation with Hierarchical Attention Networks

About

Neural Machine Translation (NMT) can be improved by including document-level contextual information. For this purpose, we propose a hierarchical attention model to capture the context in a structured and dynamic manner. The model is integrated in the original NMT architecture as another level of abstraction, conditioning on the NMT model's own previous hidden states. Experiments show that hierarchical attention significantly improves the BLEU score over a strong NMT baseline with the state-of-the-art in context-aware methods, and that both the encoder and decoder benefit from context in complementary ways.

Lesly Miculicich, Dhananjay Ram, Nikolaos Pappas, James Henderson• 2018

Related benchmarks

Task	Dataset	Result
Machine Translation	IWSLT De-En 2014 (test)	BLEU33.97	146
Machine Translation	IWSLT En-De 2014 (test)	BLEU27.94	92
Machine Translation	En -> De (test)	BLEU Score33.16	23
English-German document-level translation	News English-German (test)	s-BLEU25.03	20
English-German document-level translation	TED English-German (test)	s-BLEU0.2458	20
English-German document-level translation	Europarl English-German (test)	s-BLEU28.6	20
Machine Translation	en-fr (test)	BLEU41.95	17
Document-Level Machine Translation	TED15 Zh-En 2010-2013 (test)	d-BLEU24	16
Machine Translation	En-Ru (test)	BLEU31.23	14
Contrastive Translation Evaluation	ContraPro En-Fr	Accuracy84.32	9

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord