Word Sense Disambiguation using a Bidirectional LSTM

About

In this paper we present a clean, yet effective, model for word sense disambiguation. Our approach leverage a bidirectional long short-term memory network which is shared between all words. This enables the model to share statistical strength and to scale well with vocabulary size. The model is trained end-to-end, directly from the raw text to sense labels, and makes effective use of word order. We evaluate our approach on two standard datasets, using identical hyperparameter settings, which are in turn tuned on a third set of held out data. We employ no external resources (e.g. knowledge graphs, part-of-speech tagging, etc), language specific features, or hand crafted rules, but still achieve statistically equivalent results to the best state-of-the-art systems, that employ no such limitations.

Mikael K{\aa}geb\"ack, Hans Salomonsson• 2016

Related benchmarks

Task	Dataset	Result
Word Sense Disambiguation	SensEval-3 (test)	F1 Score73.4	51
Word Sense Disambiguation	SensEval-2 (test)	F1 Score66.9	35
Word Sense Disambiguation	Senseval-2	F1 Score71.1	20
Word Sense Disambiguation	Senseval-3	F1 Score68.4	20
Word Sense Disambiguation	Senseval-3 English Lexical Sample (test)	Accuracy73.4	13
Word Sense Disambiguation	SemEval Task 13 2015	F1 Score68.3	12
Word Sense Disambiguation	Concatenation of Datasets SE2 SE3 SE13 SE15 (test)	Noun Accuracy0.695	12
Word Sense Disambiguation	SemEval Task 12 2013	F1 Score0.648	12
Word Sense Disambiguation	SensEval-3 English lexical samples	F1 Score73.4	11
Word Sense Disambiguation	Senseval-2 English Lexical Sample (test)	Accuracy66.9	11

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord