| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WikiTQ | GPT-5-nano | Accuracy91.84 | 149 | 20d ago | |
| WikiTQ (test) | Commented Reasoning Framework + Answer Selector with Table-R1 | Accuracy84.3 | 140 | 14d ago | |
| TableBench 1.0 (test) | Qwen3-8B | Accuracy48.46 | 136 | 13d ago | |
| HiTab | Accuracy94.1 | 121 | 1mo ago | ||
| WTQ | Accuracy91.25 | 101 | 3mo ago | ||
| TABMWP | Qwen3-VL-8B + TABQAWORLD | Accuracy94.97 | 97 | 1mo ago | |
| WikiTableQuestions (test) | TabLaP | Accuracy76.6 | 86 | 3mo ago | |
| NQTables (test) | ReasonBERTT | F1 Score72.5 | 71 | 1mo ago | |
| NQ-Table | QUIETT | F1 Score80.1 | 63 | 1mo ago | |
| WTQ (test) | T5 + TC + P | Denotation Accuracy64.7 | 62 | 2mo ago | |
| AIT-QA | Accuracy93.5 | 58 | 1mo ago | ||
| WikiSQL (test) | SYNTQA (Oracle) | Accuracy95.1 | 55 | 3mo ago | |
| HiTab | QUIETT | F1 Score84.41 | 50 | 3mo ago | |
| SequentialQA | QUIETT | F1 Score73 | 50 | 3mo ago | |
| WikiTQ | QUIETT | F1 Score79.8 | 50 | 3mo ago | |
| Financial TableQA | Qwen-2.5-7B | Execution Accuracy85.51 | 48 | 1mo ago | |
| WikiSQL | Accuracy92.07 | 47 | 23h ago | ||
| TAT-QA | Qwen3-VL-8B + TABQAWORLD | Accuracy75.88 | 45 | 1mo ago | |
| TableBench | MATA | EM62 | 40 | 3mo ago | |
| Penguins in a Table | SynTQA | EM96.5 | 40 | 3mo ago | |
| HiTab | TableLlama + Oracle | Accuracy64.71 | 30 | 3mo ago | |
| WikiTQ | DataFactory | Accuracy77.2 | 29 | 2mo ago | |
| WikiTable Questions (WTQ) | Qwen3-VL-8B + TABQAWORLD | Accuracy88.89 | 28 | 1mo ago | |
| WTQ (Evaluation Set) | LLAMA-3.1-8B-INST | Alignment Score47 | 24 | 1mo ago | |
| WTQ (train) | QWEN2.5-14B-INST | Alignment Score44 | 24 | 1mo ago |