Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora

About

A word's sentiment depends on the domain in which it is used. Computational social science research thus requires sentiment lexicons that are specific to the domains being studied. We combine domain-specific word embeddings with a label propagation framework to induce accurate domain-specific sentiment lexicons using small sets of seed words, achieving state-of-the-art performance competitive with approaches that rely on hand-curated resources. Using our framework we perform two large-scale empirical studies to quantify the extent to which sentiment varies across time and between communities. We induce and release historical sentiment lexicons for 150 years of English and community-specific sentiment lexicons for 250 online communities from the social media forum Reddit. The historical lexicons show that more than 5% of sentiment-bearing (non-neutral) English words completely switched polarity during the last 150 years, and the community-specific lexicons highlight how sentiment varies drastically between different communities.

William L. Hamilton, Kevin Clark, Jure Leskovec, Dan Jurafsky• 2016

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisTwitter
Accuracy68.59
20
Sentiment ClassificationMovie
Accuracy0.7228
16
Sentiment AnalysisSemEval English 2017 (test)
Macro-F163.45
15
Sentiment AnalysisMovie English (test)
F1 Score0.6818
8
Sentiment AnalysisTwitter English (test)
F1 Score64.53
8
Showing 5 of 5 rows

Other info

Follow for update