EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting

About

Data-driven reports communicate decision-relevant insights by tightly interleaving narrative text with charts grounded in underlying tables. However, current LLM-based systems typically generate narratives and visualizations in staged pipelines, following either a text-first-graph-second or a graph-first-text-second paradigm. These designs often lead to chart-text inconsistency and insight freezing, where the intermediate evidence space becomes fixed and the model can no longer retrieve or construct new visual evidence as the narrative evolves, resulting in shallow and predefined analysis. To address the limitations, we propose \textbf{EvidFuse}, a training-free multi-agent framework that enables writing-time text-chart interleaved generation for data-driven reports. EvidFuse decouples visualization analysis from long-form drafting via two collaborating components: a \textbf{Data-Augmented Analysis Agent}, equipped with Exploratory Data Analysis (EDA)-derived knowledge and access to raw tables, and a \textbf{Real-Time Evidence Construction Writer} that plans an outline and drafts the report while intermittently issuing fine-grained analysis requests. This design allows visual evidence to be constructed and incorporated exactly when the narrative requires it, directly constraining subsequent claims and enabling on-demand expansion of the evidence space. Experiments demonstrate that EvidFuse attains the top rank in both LLM-as-a-judge and human evaluations on chart quality, chart-text alignment, and report-level usefulness.

Huanxiang Lin, Qianyue Wang, Jinwu Hu, Bailin Chen, Qing Du, Mingkui Tan• 2026

Related benchmarks

Task	Dataset	Result
Automatic Report Evaluation	Tableau	Readability Score1.85	12
Automatic Report Evaluation	OurWorld InData	Readability1.5	12
Data-driven Report Generation	Tableau	Figure Count8.3	8
Data-driven Report Generation	OurWorldInData	Average Figures Used8.8	8
Data-driven Report Generation	USAFact	Average Figures Used6.55	8
Automatic Report Evaluation	USAFact	Readability1.5	8
Report Generation	Tableau	API Score41.85	4
Report Generation	OurWorldInData	API Interactions43.2	4
Report Generation	USAFact	API Score33.13	4
Data-driven Report Generation	Tableau (test)	Readability1.9	4

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord