Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Factual Inconsistency Detection on CHOCOLATE LLM

0.205Kendall's Tau

GPT-4V

0.015720.064860.1140.16314Dec 15, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.12
0.205
0.117
2023.12
0.105
2023.12
0.091
0.057
2023.12
0.057
0.057
2023.12
0.045
2023.12
0.023