Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hallucination Assessment on MHumanEval

72.6Response Rate

LLaVA-RLHF

27.77639.41351.0562.687May 27, 2024
Updated 3d ago

Evaluation Results

MethodLinks
2024.05
72.6
2024.05
68.5
2024.05
67.8
2024.05
67.1
2024.05
67.1
2024.05
63.7
2024.05
63
2024.05
61
2024.05
59.6
2024.05
55.5
2024.05
54.8
2024.05
54.1
2024.05
53.4
2024.05
53.4
2024.05
52.7
2024.05
45.9
2024.05
44.5
2024.05
39.7
2024.05
35.6
2024.05
29.5