Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Inconsistency Detection on CHOCOLATE LLM
Loading...
0.205
Kendall's Tau
GPT-4V
0.01572
0.06486
0.114
0.16314
Dec 15, 2023
Kendall's Tau
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall's Tau
GPT-4V
2023.12
0.205
DePlot + GPT-4
2023.12
0.117
Bard
2023.12
0.105
CHARTVE
2023.12
0.091
LLaVA-1.5-13B
2023.12
0.057
ChartLlama
2023.12
0.057
ChartAssistant-S
2023.12
0.057
QAFACTEVAL
2023.12
0.045
SUMMAC
2023.12
0.023
Feedback
Search any
task
Search any
task