Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Factual Inconsistency Detection on CHOCOLATE LLM

0.205Kendall's Tau

GPT-4V

0.015720.064860.1140.16314Dec 15, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.12
0.205
0.117
2023.12
0.105
2023.12
0.091
0.057
2023.12
0.057
0.057
2023.12
0.045
2023.12
0.023