EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting
About
Data-driven reports communicate decision-relevant insights by tightly interleaving narrative text with charts grounded in underlying tables. However, current LLM-based systems typically generate narratives and visualizations in staged pipelines, following either a text-first-graph-second or a graph-first-text-second paradigm. These designs often lead to chart-text inconsistency and insight freezing, where the intermediate evidence space becomes fixed and the model can no longer retrieve or construct new visual evidence as the narrative evolves, resulting in shallow and predefined analysis. To address the limitations, we propose \textbf{EvidFuse}, a training-free multi-agent framework that enables writing-time text-chart interleaved generation for data-driven reports. EvidFuse decouples visualization analysis from long-form drafting via two collaborating components: a \textbf{Data-Augmented Analysis Agent}, equipped with Exploratory Data Analysis (EDA)-derived knowledge and access to raw tables, and a \textbf{Real-Time Evidence Construction Writer} that plans an outline and drafts the report while intermittently issuing fine-grained analysis requests. This design allows visual evidence to be constructed and incorporated exactly when the narrative requires it, directly constraining subsequent claims and enabling on-demand expansion of the evidence space. Experiments demonstrate that EvidFuse attains the top rank in both LLM-as-a-judge and human evaluations on chart quality, chart-text alignment, and report-level usefulness.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Automatic Report Evaluation | Tableau | Readability Score1.85 | 12 | |
| Automatic Report Evaluation | OurWorld InData | Readability1.5 | 12 | |
| Data-driven Report Generation | Tableau | Figure Count8.3 | 8 | |
| Data-driven Report Generation | OurWorldInData | Average Figures Used8.8 | 8 | |
| Data-driven Report Generation | USAFact | Average Figures Used6.55 | 8 | |
| Automatic Report Evaluation | USAFact | Readability1.5 | 8 | |
| Report Generation | Tableau | API Score41.85 | 4 | |
| Report Generation | OurWorldInData | API Interactions43.2 | 4 | |
| Report Generation | USAFact | API Score33.13 | 4 | |
| Data-driven Report Generation | Tableau (test) | Readability1.9 | 4 |