Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Detection on VisionHall
Loading...
45.15
F1 Score
ZINA
-0.9532
11.0159
22.985
34.9541
Jun 16, 2025
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
ZINA
Shot Setting=3-shot
2025.06
45.15
GPT-4o
Shot Setting=3-shot
2025.06
29.37
GPT-4o (w/o images)
Shot Setting=3-shot, I...
2025.06
27.02
LLaVA-OV-Qwen2-72B
Shot Setting=3-shot
2025.06
25.7
Qwen2.5-VL-72B-Instruct
Shot Setting=3-shot
2025.06
21.31
LLaVA-NeXT-Qwen-32B
Shot Setting=3-shot
2025.06
19.09
Llama-3.2-90B-Vision-Instruct
Shot Setting=3-shot
2025.06
16.92
LLaVA-1.5-13B
Shot Setting=3-shot
2025.06
4.73
LLaVA-OV-Qwen2-7B
Shot Setting=3-shot
2025.06
3.39
Qwen2-VL-7B
Shot Setting=3-shot
2025.06
3.36
LLaVA-1.5-7B
Shot Setting=3-shot
2025.06
0.82
Feedback
Search any
task
Search any
task