Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-image hallucination evaluation on MUIRBench
Loading...
62
Accuracy
Qwen2.5-VL + CAPL
28.564
37.2445
45.925
54.6055
Mar 7, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-VL + CAPL
Params=7B
2026.03
62
GLM4.1VBase + CAPL
Params=9B
2026.03
60.57
Qwen2.5-VL
Params=7B
2026.03
58.42
GLM4.1VBase
Params=9B
2026.03
57.84
InternVL2.5 + CAPL
Params=8B
2026.03
52.12
InternVL2.5
Params=8B
2026.03
48.54
InternVL2
Params=7B
2026.03
45.61
Qwen2VL
Params=7B
2026.03
39.57
Idefics3
Params=8B
2026.03
30.96
LLaVA-OV
Params=7B
2026.03
30.85
LLaVA-Next
Params=7B
2026.03
30.5
Idefics2
Params=8B
2026.03
29.85
Feedback
Search any
task
Search any
task