Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Data-driven Report Generation on OurWorldInData (test)
Loading...
3.7
Readability Score
Direct
1.412
2.006
2.6
3.194
Jan 9, 2026
Readability Score
Layout Quality Score
Text-Content Consistency Score
Analysis Depth Score
Information Completeness Score
Visual Consistency Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Readability Score
Layout Quality Score
Text-Content Consistency Score
Analysis Depth Score
Information Completeness Score
Visual Consistency Score
Direct
Base Model=Qwen3-VL-32...
2026.01
3.7
3.9
3
3.9
3.8
3.4
DataNarrative
Base Model=Qwen3-VL-32...
2026.01
2.7
2.3
2.8
2.3
2.3
2.3
DeepAnalyze
Base Model=Qwen3-VL-32...
2026.01
2.1
2.2
2.9
2.1
2
2.3
EvidFuse
Base Model=Qwen3-VL-32...
2026.01
1.5
1.6
1.3
1.7
1.9
2
Feedback
Search any
task
Search any
task