Share your thoughts, 1 month free Claude Pro on usSee more

Factual Inconsistency Detection on CHOCOLATE LLM

0.205Kendall's Tau

GPT-4V

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4V 2023.12		0.205
DePlot + GPT-4 2023.12		0.117
Bard 2023.12		0.105
CHARTVE 2023.12		0.091
LLaVA-1.5-13B 2023.12		0.057
ChartLlama 2023.12		0.057
ChartAssistant-S 2023.12		0.057
QAFACTEVAL 2023.12		0.045
SUMMAC 2023.12		0.023