| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Table Question Answering | TableBench 1.0 (test) | Accuracy48.46 | 136 | |
| Fact Checking | TableBench (test) | Accuracy85.42 | 136 | |
| Complex Tabular Reasoning | TableBench | TB-DA68.8 | 45 | |
| Table Question Answering | TableBench | EM62 | 40 | |
| Table Reasoning | TABLEBENCH | Accuracy63.48 | 39 | |
| Table Reasoning | TableBench | DP44.54 | 31 | |
| Table Reasoning | TableBench | Pass@172.52 | 21 | |
| Numerical Reasoning | TableBench (test) | Accuracy64.48 | 13 | |
| Table Chain of Thought Reasoning | TableBench | Rge54.28 | 13 | |
| Symbolic Chain of Thought Reasoning | TableBench | Rge1.99 | 13 | |
| Program of Thought Reasoning | TableBench | Rge Score51.96 | 13 | |
| Data Processing | TableBench | Rge52.18 | 13 | |
| Data Analysis | TableBench (test) | Accuracy26.63 | 10 | |
| Table Question Answering | TableBench (test) | Overall Accuracy0.4887 | 10 | |
| STRING | TableBench | LLM Judge Accuracy32 | 9 | |
| Agentic Task Solving | TableBench | Pass@343 | 9 | |
| String Extraction | TableBench | Exact Match31.6 | 9 | |
| Table-QA | TableBench | Quality Score33.6 | 4 |