Adapt or Get Left Behind: Domain Adaptation through BERT Language Model Finetuning for Aspect-Target Sentiment Classification

About

Aspect-Target Sentiment Classification (ATSC) is a subtask of Aspect-Based Sentiment Analysis (ABSA), which has many applications e.g. in e-commerce, where data and insights from reviews can be leveraged to create value for businesses and customers. Recently, deep transfer-learning methods have been applied successfully to a myriad of Natural Language Processing (NLP) tasks, including ATSC. Building on top of the prominent BERT language model, we approach ATSC using a two-step procedure: self-supervised domain-specific BERT language model finetuning, followed by supervised task-specific finetuning. Our findings on how to best exploit domain-specific language model finetuning enable us to produce new state-of-the-art performance on the SemEval 2014 Task 4 restaurants dataset. In addition, to explore the real-world robustness of our models, we perform cross-domain evaluation. We show that a cross-domain adapted BERT language model performs significantly better than strong baseline models like vanilla BERT-base and XLNet-base. Finally, we conduct a case study to interpret model prediction errors.

Alexander Rietzler, Sebastian Stabinger, Paul Opitz, Stefan Engl• 2019

Related benchmarks

Task	Dataset	Result
Aspect-Term Sentiment Analysis	LAPTOP SemEval 2014 (test)	Macro-F178.74	69
Aspect-level sentiment classification	SemEval Restaurant 2014 (test)	Accuracy87.89	67
Aspect-level sentiment classification	SemEval Laptop 2014 (test)	Accuracy80.23	59
Aspect-based Sentiment Analysis	REST 2014 (test)	ABSA F1 Score80.05	37
Aspect-based Sentiment Analysis	SemEval Task 4 Subtask 2 Restaurant domain 2014 (test)	Accuracy87.89	30
Aspect-level Sentiment Analysis	Laptop L (test)	Accuracy78.96	24
Aspect-based Sentiment Analysis	LAP 2014 (test)	All-Aspect Score78.96	22
Aspect-based Sentiment Analysis	Restaurant14 original (test)	Accuracy87.14	20
Aspect-based Sentiment Analysis	Laptop14 original (test)	Accuracy78.96	20
Aspect-based Sentiment Analysis	SemEval Laptop 2014	F1 Score74.18	19

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord