Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Robustness on HallusionBench (fAcc)
Loading...
37.3
fAcc
Qwen2.5-VL + DRScaffold
24.612
27.906
31.2
34.494
May 25, 2026
fAcc
Updated 8d ago
Evaluation Results
Method
Method
Links
fAcc
Qwen2.5-VL + DRScaffold
Scale=3B
2026.05
37.3
Qwen2.5-VL
Scale=3B
2026.05
35.2
Phi4-multimodal + DRScaffold
Scale=5.6B
2026.05
34.4
InternVL2.5 + DRScaffold
Scale=2B
2026.05
33.8
InternVL2.5
Scale=2B
2026.05
32.6
Qwen2.5-VL
Scale=32B
2026.05
31.2
Phi4-multimodal
Scale=5.6B
2026.05
25.1
Feedback
Search any
task
Search any
task