Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HallusionBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination EvaluationHallusionBench
Average Score93.1
93
Hallucination and Visual Reasoning EvaluationHallusionBench
Score59.2
37
Hallucination RobustnessHallusionBench
Score57.8
32
Hallucination AssessmentHallusionBench
Question Accuracy (qAcc)49
30
Visual Hallucination EvaluationHallusionBench
Accuracy (Q)31.42
19
Multimodal ReasoningHallusionBench
Accuracy0.709
17
HallucinationHallusionBench
Pass@174
16
Multimodal Hallucination EvaluationHallusionBench
Hallucination Score70.7
14
Hallucination EvaluationHallusionBench 2024
Score52.2
13
Visual Illusion and Hallucination EvaluationHallusionBench (HallB)
HallB Score41.7
13
Hallucination EvaluationHallusionBench GPT4-assisted (All)
Accuracy (All)49.94
11
Visual Hallucination EvaluationHallusionBench visual questions
Accuracy65.8
10
Vision-Language ReasoningHallusionBench (test)
Simple Accuracy53.31
7
General visual question answeringHallusionBench
Pass@163.7
7
Hallucination controlHallusionBench
General Score60.5
6
General VQAHallusionBench
Accuracy73.48
5
Hallucination AnalysisHallusionBench
fACC18.7
4
Hallucination EvaluationHallusionBench (test)
Question Pair Accuracy17.8
4
Visual Question AnsweringHallusionBench HBI (all)
Score45.21
4
Paired-prompt evaluationHallusionBench
Simple Accuracy52.89
2
Visual Question AnsweringHallusionBench
Simple Accuracy51.31
2
Showing 21 of 21 rows