Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

About

We introduce a model for constructing vector representations of words by composing characters using bidirectional LSTMs. Relative to traditional word representation models that have independent vectors for each word type, our model requires only a single vector per character type and a fixed set of parameters for the compositional model. Despite the compactness of this model and, more importantly, the arbitrary nature of the form-function relationship in language, our "composed" word representations yield state-of-the-art results in language modeling and part-of-speech tagging. Benefits over traditional baselines are particularly pronounced in morphologically rich languages (e.g., Turkish).

Wang Ling, Tiago Lu\'is, Lu\'is Marujo, Ram\'on Fernandez Astudillo, Silvio Amir, Chris Dyer, Alan W. Black, Isabel Trancoso• 2015

Related benchmarks

Task	Dataset	Result
Natural Language Inference	SNLI (test)	Accuracy82.89	694
Natural Language Inference	SNLI (dev)	Accuracy83.78	71
Natural Language Inference	MultiNLI matched (test)	Accuracy68.89	65
Part-of-Speech Tagging	Penn Treebank (test)	Accuracy97.78	64
Natural Language Inference	MultiNLI mismatched (test)	Accuracy69.76	56
Part-of-Speech Tagging	WSJ (test)	Accuracy97.78	51
POS Tagging	PTB (test)	Accuracy97.78	24
Natural Language Inference	MultiNLI matched (dev)	Accuracy69.76	23
POS Tagging	Turkish (TR) (test)	Accuracy91.59	14
Natural Language Inference	MultiNLI mismatched (dev)	Accuracy70.48	11

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord