| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Table Question Answering | TAT-QA | Accuracy75.88 | 45 | |
| Multi-hop Reasoning | TAT-QA | F1 Score81 | 21 | |
| Question Answering | TAT-QA (held-out) | Accuracy59.33 | 21 | |
| Table Question Answering | TAT-QA | Execution Match (EM)77.78 | 17 | |
| Table Question Answering | TAT-QA (test) | Accuracy60.75 | 15 | |
| Financial Question Answering | TAT-QA (test) | Execution Accuracy74.96 | 9 | |
| Question Answering | TAT-QA | HalRate31.5 | 9 | |
| Question Answering | TAT-QA | Accuracy75.7 | 9 | |
| Question Answering | TAT-QA 1.0 (test) | EM84.1 | 6 | |
| Question Answering | TAT-QA 1.0 (dev) | EM55.2 | 5 | |
| Question Answering | TAT-QA (eval) | Risk8.2 | 4 | |
| Numerical Reasoning | TAT-QA (dev) | Exact Match (EM)59.1 | 4 | |
| Question Answering | TAT-QA (dev) | EM59.1 | 4 | |
| Table structure recognition | TAT-QA (held-out) | TEDS Score70 | 3 |