Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Word Sense Disambiguation using a Bidirectional LSTM

About

In this paper we present a clean, yet effective, model for word sense disambiguation. Our approach leverage a bidirectional long short-term memory network which is shared between all words. This enables the model to share statistical strength and to scale well with vocabulary size. The model is trained end-to-end, directly from the raw text to sense labels, and makes effective use of word order. We evaluate our approach on two standard datasets, using identical hyperparameter settings, which are in turn tuned on a third set of held out data. We employ no external resources (e.g. knowledge graphs, part-of-speech tagging, etc), language specific features, or hand crafted rules, but still achieve statistically equivalent results to the best state-of-the-art systems, that employ no such limitations.

Mikael K{\aa}geb\"ack, Hans Salomonsson• 2016

Related benchmarks

TaskDatasetResultRank
Word Sense DisambiguationSensEval-3 (test)
F1 Score73.4
51
Word Sense DisambiguationSensEval-2 (test)
F1 Score66.9
35
Word Sense DisambiguationSenseval-2
F1 Score71.1
20
Word Sense DisambiguationSenseval-3
F1 Score68.4
20
Word Sense DisambiguationSenseval-3 English Lexical Sample (test)
Accuracy73.4
13
Word Sense DisambiguationSemEval Task 13 2015
F1 Score68.3
12
Word Sense DisambiguationConcatenation of Datasets SE2 SE3 SE13 SE15 (test)
Noun Accuracy0.695
12
Word Sense DisambiguationSemEval Task 12 2013
F1 Score0.648
12
Word Sense DisambiguationSensEval-3 English lexical samples
F1 Score73.4
11
Word Sense DisambiguationSenseval-2 English Lexical Sample (test)
Accuracy66.9
11
Showing 10 of 12 rows

Other info

Code

Follow for update