WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain
About
Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases for better masking, together with span boundary objective and in-filing objective. Additionally, the evaluation benchmarks in the field have been limited. To this end, we contribute the Financial Language Understanding Evaluation (FLUE), an open-source comprehensive suite of benchmarks for the financial domain. These include new benchmarks across 5 NLP tasks in financial domain as well as common benchmarks used in the previous research. Experiments on these benchmarks suggest that our model outperforms those in prior literature on a variety of NLP tasks. Our models, code and benchmark data are publicly available on Github and Huggingface.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | NER | -- | 40 | |
| Sentiment Analysis | FOMC | -- | 26 | |
| Classification | Headline | F1 Score91.06 | 9 | |
| Financial Entity Recognition | FiNER | F1 Score81.52 | 9 | |
| Sentiment Analysis | Financial PhraseBank (FPB) | Accuracy84.76 | 9 | |
| Question Answering | FinQA | Prog Acc49.17 | 9 |