Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Factual Error Correction on CHOCOLATE LLM 1.0

52.35GPT-4V Score

GPT-4V

21.25429.32737.445.473Dec 15, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.12
52.3550.57
2023.12
39.2953.11
2023.12
31.7777.63
2023.12
25.5155.35
2023.12
23.470
2023.12
22.459.2
2023.12
22.4521.25