Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Hallucination Assessment on HallusionBench
Loading...
70
Accuracy
Qwen3.5-27B
62.928
64.764
66.6
68.436
Apr 9, 2026
Accuracy
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-27B
Mode=REASONING, Archit...
2026.04
70
Qwen3-VL-32B
Mode=Thinking, Archite...
2026.04
67.4
Qwen3-VL-235B-A22B
Mode=Thinking, Archite...
2026.04
66.7
EXAONE 4.5 33B
Mode=REASONING, Archit...
2026.04
63.7
GPT-5 mini
Mode=REASONING: HIGH,...
2026.04
63.2
Feedback
Search any
task
Search any
task