SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
About
Recently, sentiment analysis has seen remarkable advance with the help of pre-training approaches. However, sentiment knowledge, such as sentiment words and aspect-sentiment pairs, is ignored in the process of pre-training, despite the fact that they are widely used in traditional sentiment analysis approaches. In this paper, we introduce Sentiment Knowledge Enhanced Pre-training (SKEP) in order to learn a unified sentiment representation for multiple sentiment analysis tasks. With the help of automatically-mined knowledge, SKEP conducts sentiment masking and constructs three sentiment knowledge prediction objectives, so as to embed sentiment information at the word, polarity and aspect level into pre-trained sentiment representation. In particular, the prediction of aspect-sentiment pairs is converted into multi-label classification, aiming to capture the dependency between words in a pair. Experiments on three kinds of sentiment tasks show that SKEP significantly outperforms strong pre-training baseline, and achieves new state-of-the-art results on most of the test datasets. We release our code at https://github.com/baidu/Senta.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sentiment Analysis | SST-2 (test) | Accuracy97 | 136 | |
| Aspect-level sentiment classification | SemEval Restaurant 2014 (test) | Accuracy88.01 | 67 | |
| Stock movement classification | Astock in-distribution 1.0 | Accuracy60.66 | 21 | |
| Aspect-level Sentiment Analysis | SemEval Task 4 Laptop 2014 (test) | Accuracy81.47 | 19 | |
| Stock trend prediction | Astock 2020-10-01 to 2020-12-31 (test) | Accuracy60.66 | 19 | |
| Opinion Role Labeling | MPQA Target 2.0 (test) | -- | 16 | |
| Opinion Role Labeling | MPQA Holder 2.0 (test) | Binary F1 Score85.77 | 5 | |
| Sentiment Analysis (Sentence-Level) | Amazon-2 v1 (test) | Accuracy97.56 | 5 |