Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc

About

The rise of generative chat-based Large Language Models (LLMs) over the past two years has spurred a race to develop systems that promise near-human conversational and reasoning experiences. However, recent studies indicate that the language understanding offered by these models remains limited and far from human-like performance, particularly in grasping the contextual meanings of words, an essential aspect of reasoning. In this paper, we present a simple yet computationally efficient framework for multilingual Word Sense Disambiguation (WSD). Our approach reframes the WSD task as a cluster discrimination analysis over a semantic network refined from BabelNet using group algebra. We validate our methodology across multiple WSD benchmarks, achieving a new state of the art for all languages and tasks, as well as in individual assessments by part of speech. Notably, our model significantly surpasses the performance of current alternatives, even in low-resource languages, while reducing the parameter count by 72%.

Daniel Guzman-Olivares, Lara Quijano-Sanchez, Federico Liberatore• 2025

Related benchmarks

TaskDatasetResultRank
Word Sense DisambiguationEnglish All-Words Average (test)--
19
Word Sense DisambiguationS10
F1 Score87.5
9
Word Sense Disambiguation42D
F1 Score77.1
9
Word Sense DisambiguationsoftEN
F1 Score89.4
9
Word Sense DisambiguationhardEN
F1 Score53.4
9
Word Sense DisambiguationXL-WSD (test)
Accuracy (English)88.9
5
Showing 6 of 6 rows

Other info

Code

Follow for update