Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Robustness on HallusionBench (fAcc)

37.3fAcc

Qwen2.5-VL + DRScaffold

24.61227.90631.234.494May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
37.3
2026.05
35.2
34.4
33.8
2026.05
32.6
2026.05
31.2
2026.05
25.1