Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Video Hallucination Evaluation on VidHal
Loading...
80.9
Accuracy
InternVL3-8B
49.7
57.8
65.9
74
May 31, 2025
Accuracy
Average Score
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
Average Score
InternVL3-8B
Access=Open-source, Pa...
2025.05
80.9
67
CoF-InternVL3-8B
Parameters=8B
2025.05
79.5
72.1
CoF-InternVL2.5-4B
Parameters=4B
2025.05
79.2
64.6
GPT-4o
Access=Closed-source
2025.05
77.2
-
InternVL2.5-4B
Access=Open-source, Pa...
2025.05
77
60.8
Qwen2-VL-72B
Access=Open-source, Pa...
2025.05
76.2
62.7
Qwen2-VL-7B
Access=Open-source, Pa...
2025.05
69.6
58
Gemini-1.5-Pro
Access=Closed-source
2025.05
67.1
-
LLaVA-OneVision-72B
Access=Open-source, Pa...
2025.05
64.7
58
LLaVA-OneVision-7B
Access=Open-source, Pa...
2025.05
58.4
53.2
LLaVA-NeXT-Video-7B
Access=Open-source, Pa...
2025.05
50.9
50.2
Feedback
Search any
task
Search any
task