Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution

About

Question answering (QA) over structured tables requires not only accurate answers but also transparency about which cells support them. Existing table QA systems rarely provide fine-grained attribution, so even correct answers often lack verifiable grounding, limiting trust in high-stakes settings. We address this with TraceBack, a modular multi-agent framework for scalable, cell-level attribution in single-table QA. TraceBack prunes tables to relevant rows and columns, decomposes questions into semantically coherent sub-questions, and aligns each answer span with its supporting cells, capturing both explicit and implicit evidence used in intermediate reasoning steps. To enable systematic evaluation, we release CITEBench, a benchmark with phrase-to-cell annotations drawn from ToTTo, FetaQA, and AITQA. We further propose FairScore, a reference-less metric that compares atomic facts derived from predicted cells and answers to estimate attribution precision and recall without human cell labels. Experiments show that TraceBack substantially outperforms strong baselines across datasets and granularities, while FairScore closely tracks human judgments and preserves relative method rankings, supporting interpretable and scalable evaluation of table-based QA.

Tejas Anvekar, Junha Park, Rajat Jha, Devanshu Gupta, Poojah Ganesan, Puneeth Mathur, Vivek Gupta• 2026

Related benchmarks

TaskDatasetResultRank
Cell-level attributionToTTo (gold set)
Precision56.89
6
Cell-level attributionFetaQA (gold set)
Precision0.6373
6
Cell-level attributionAITQA (gold set)
Precision63.44
6
Cell-level attributionToTTo
Precision74.2
6
Cell-level attributionFeTaQA
Precision0.8981
6
Cell-level attributionAITQA
Precision73.7
6
Column-Level AttributionFeTaQA
Precision (%)96.39
6
Column-Level AttributionAITQA
Precision (%)79.2
6
Row-Level AttributionToTTo
Precision77
6
Row-Level AttributionFeTaQA
Precision94.3
6
Showing 10 of 12 rows

Other info

Follow for update