Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Inconsistency Detection on CHOCOLATE (FT)
Loading...
0.291
Kendall's Tau
Bard
0.0258
0.09465
0.1635
0.23235
Dec 15, 2023
Kendall's Tau
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall's Tau
Bard
2023.12
0.291
GPT-4V
2023.12
0.215
CHARTVE
2023.12
0.215
LLaVA-1.5-13B
2023.12
0.211
ChartLlama
2023.12
0.141
DePlot + GPT-4
2023.12
0.109
QAFACTEVAL
2023.12
0.054
SUMMAC
2023.12
0.036
ChartAssistant-S
2023.12
0.036
Feedback
Search any
task
Search any
task