Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Explanation generation on AIChartClaim (test)
Loading...
42.9
ROUGE-1
MEVER
24.804
29.502
34.2
38.898
Feb 10, 2026
ROUGE-1
ROUGE-2
BLEU-4
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-1
ROUGE-2
BLEU-4
MEVER
evidence_setting=gold...
2026.02
42.9
21.9
11.6
ChartGemma
evidence_setting=gold...
2026.02
42.2
21.3
10.4
MEVER
evidence_setting=gold...
2026.02
41.8
20.8
10.6
UniChart
evidence_setting=gold...
2026.02
41.6
20.9
11
ECENet
evidence_setting=gold...
2026.02
41.5
20.4
10.2
DePlot+FlanT5
evidence_setting=gold...
2026.02
39.7
19.4
8.9
MochegModel
evidence_setting=gold...
2026.02
39.5
19.3
8.9
JustiLM
evidence_setting=gold...
2026.02
31.4
14
7
GPT-4o
evidence_setting=gold...
2026.02
25.5
18.1
6.8
Feedback
Search any
task
Search any
task