Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

About

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio• 2014

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText-2 (test)	PPL71.8	2333
Time Series Forecasting	ETTm2	MSE2.041	536
Time Series Forecasting	Weather	MSE0.369	497
Language Modeling	Penn Treebank (test)	Perplexity66.3	420
Subjectivity Classification	Subj	Accuracy90.5	343
Text Classification	TREC	Accuracy89.6	281
Question Classification	TREC	Accuracy82.8	262
Language Modeling	WikiText-103 (val)	PPL689.5	261
Time Series Forecasting	Exchange	MSE1.453	227
Time Series Forecasting	Traffic	MSE0.843	211

Showing 10 of 193 rows

...

Other info

Follow for update

@wizwand_team Discord