Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-image Hallucination Evaluation on BLINK
Loading...
61.33
Accuracy
GLM4.1VBase + CAPL
40.3844
45.8222
51.26
56.6978
Mar 7, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GLM4.1VBase + CAPL
Params=9B
2026.03
61.33
GLM4.1VBase
Params=9B
2026.03
58.17
Qwen2.5-VL + CAPL
Params=7B
2026.03
57.76
InternVL2.5 + CAPL
Params=8B
2026.03
55.76
InternVL2.5
Params=8B
2026.03
54.81
Qwen2.5-VL
Params=7B
2026.03
54.6
Qwen2VL
Params=7B
2026.03
53.17
InternVL2
Params=7B
2026.03
50.34
Idefics3
Params=8B
2026.03
48.34
Idefics2
Params=8B
2026.03
45.24
LLaVA-OV
Params=7B
2026.03
44.77
LLaVA-Next
Params=7B
2026.03
41.19
Feedback
Search any
task
Search any
task