| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Financial Question Answering | FinQA (test) | Accuracy76.05 | 42 | |
| Numerical Question Answering | FinQA (test) | Execution Accuracy91.16 | 33 | |
| Financial Reasoning | FinQA | Accuracy77.1 | 19 | |
| Financial Open-ended QA | FinQA (test) | Token Accuracy29.67 | 16 | |
| Financial Open-ended Question Answering | FinQA (test) | Token Perplexity3.9697 | 16 | |
| Numerical Question Answering | FinQA 1.0 (test) | Execution Accuracy91.16 | 14 | |
| Question Answering | FinQA (val) | Execution Accuracy0.6122 | 10 | |
| Question Answering | FinQA | Prog Acc59.37 | 9 | |
| RAG Poisoning Attack (Document-Level Targeting) | FinQA | RSR@547.1 | 7 | |
| Fact-Level RAG Poisoning Attack | FinQA | RSR@599.8 | 7 | |
| Numerical Reasoning Question Answering | FinQA v1 (dev) | Execution Accuracy72.91 | 7 | |
| Fact Retrieval | FinQA (test) | Recall@393.31 | 7 | |
| Fact Retrieval | FinQA (dev) | R@395.03 | 7 | |
| Table Question Answering | FinQA (dev) | Accuracy59 | 4 |