Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Factual Correction on CHOCOLATE LLM

52.35GPT-4V Score

GPT-4V

21.25429.32737.445.473Dec 15, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.12
52.35
2023.12
39.29
2023.12
31.77
2023.12
23.47
2023.12
22.45
2023.12
22.45