FinBERT: Financial Sentiment Analysis with Pre-trained Language Models
About
Financial sentiment analysis is a challenging task due to the specialized language and lack of labeled data in that domain. General-purpose models are not effective enough because of the specialized language used in a financial context. We hypothesize that pre-trained language models can help with this problem because they require fewer labeled examples and they can be further trained on domain-specific corpora. We introduce FinBERT, a language model based on BERT, to tackle NLP tasks in the financial domain. Our results show improvement in every measured metric on current state-of-the-art results for two financial sentiment analysis datasets. We find that even with a smaller training set and fine-tuning only a part of the model, FinBERT outperforms state-of-the-art machine learning methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | NER | -- | 40 | |
| Sentiment Analysis | FOMC | -- | 26 | |
| Disease prediction | Haodf Diabetes | Hit Rate @ 146.35 | 16 | |
| Disease prediction | Haodf Coronary Heart Disease | Hit Rate @ 121.57 | 16 | |
| Disease prediction | Haodf Common Cold | Hit@12.66 | 16 | |
| Disease prediction | Haodf Pneumonia | Hit@115.51 | 16 | |
| Disease prediction | Haodf Depression | Hit Rate @ 10.3302 | 16 | |
| Disease prediction | Haodf Lung | Hit Rate @ 147.78 | 16 | |
| Named Entity Recognition | Payment Transaction Dataset 1.0 (test) | Precision94.8 | 11 | |
| Financial Natural Language Processing | FinDATA | TSA0.2054 | 10 |