Share your thoughts, 1 month free Claude Pro on usSee more

Factual Error Detection on CHOCOLATE 1.0 (LLM)

73.8ROC AUC

GPT-4V

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4V 2023.12		73.8
DePlot + GPT-4 2023.12		62.9
Bard 2023.12		61.7
CHARTVE 2023.12		59.5
LLaVA-1.5-13B 2023.12		56.6