Adapt or Get Left Behind: Domain Adaptation through BERT Language Model Finetuning for Aspect-Target Sentiment Classification
About
Aspect-Target Sentiment Classification (ATSC) is a subtask of Aspect-Based Sentiment Analysis (ABSA), which has many applications e.g. in e-commerce, where data and insights from reviews can be leveraged to create value for businesses and customers. Recently, deep transfer-learning methods have been applied successfully to a myriad of Natural Language Processing (NLP) tasks, including ATSC. Building on top of the prominent BERT language model, we approach ATSC using a two-step procedure: self-supervised domain-specific BERT language model finetuning, followed by supervised task-specific finetuning. Our findings on how to best exploit domain-specific language model finetuning enable us to produce new state-of-the-art performance on the SemEval 2014 Task 4 restaurants dataset. In addition, to explore the real-world robustness of our models, we perform cross-domain evaluation. We show that a cross-domain adapted BERT language model performs significantly better than strong baseline models like vanilla BERT-base and XLNet-base. Finally, we conduct a case study to interpret model prediction errors.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Aspect-Term Sentiment Analysis | LAPTOP SemEval 2014 (test) | Macro-F178.74 | 69 | |
| Aspect-level sentiment classification | SemEval Restaurant 2014 (test) | Accuracy87.89 | 67 | |
| Aspect-level sentiment classification | SemEval Laptop 2014 (test) | Accuracy80.23 | 59 | |
| Aspect-based Sentiment Analysis | SemEval Task 4 Subtask 2 Restaurant domain 2014 (test) | Accuracy87.89 | 30 | |
| Aspect-level Sentiment Analysis | Laptop L (test) | Accuracy78.96 | 24 | |
| Aspect-based Sentiment Analysis | Restaurant14 original (test) | Accuracy87.14 | 20 | |
| Aspect-based Sentiment Analysis | Laptop14 original (test) | Accuracy78.96 | 20 | |
| Aspect-based Sentiment Analysis | SemEval Laptop 2014 | F1 Score74.18 | 19 | |
| Aspect-based Sentiment Analysis | SemEval Restaurant 2014 (All) | F1 Score80.05 | 19 | |
| Implicit Sentiment Analysis | Restaurant14 ISA original (test) | Accuracy65.92 | 19 |