Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting

About

Data-driven reports communicate decision-relevant insights by tightly interleaving narrative text with charts grounded in underlying tables. However, current LLM-based systems typically generate narratives and visualizations in staged pipelines, following either a text-first-graph-second or a graph-first-text-second paradigm. These designs often lead to chart-text inconsistency and insight freezing, where the intermediate evidence space becomes fixed and the model can no longer retrieve or construct new visual evidence as the narrative evolves, resulting in shallow and predefined analysis. To address the limitations, we propose \textbf{EvidFuse}, a training-free multi-agent framework that enables writing-time text-chart interleaved generation for data-driven reports. EvidFuse decouples visualization analysis from long-form drafting via two collaborating components: a \textbf{Data-Augmented Analysis Agent}, equipped with Exploratory Data Analysis (EDA)-derived knowledge and access to raw tables, and a \textbf{Real-Time Evidence Construction Writer} that plans an outline and drafts the report while intermittently issuing fine-grained analysis requests. This design allows visual evidence to be constructed and incorporated exactly when the narrative requires it, directly constraining subsequent claims and enabling on-demand expansion of the evidence space. Experiments demonstrate that EvidFuse attains the top rank in both LLM-as-a-judge and human evaluations on chart quality, chart-text alignment, and report-level usefulness.

Huanxiang Lin, Qianyue Wang, Jinwu Hu, Bailin Chen, Qing Du, Mingkui Tan• 2026

Related benchmarks

TaskDatasetResultRank
Automatic Report EvaluationTableau
Readability Score1.85
12
Automatic Report EvaluationOurWorld InData
Readability1.5
12
Data-driven Report GenerationTableau
Figure Count8.3
8
Data-driven Report GenerationOurWorldInData
Average Figures Used8.8
8
Data-driven Report GenerationUSAFact
Average Figures Used6.55
8
Automatic Report EvaluationUSAFact
Readability1.5
8
Report GenerationTableau
API Score41.85
4
Report GenerationOurWorldInData
API Interactions43.2
4
Report GenerationUSAFact
API Score33.13
4
Data-driven Report GenerationTableau (test)
Readability1.9
4
Showing 10 of 13 rows

Other info

Follow for update