Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Evaluation on HalBench
Loading...
76.7
Score
Qwen3-VL-8B-Thinking + DiG
49.036
56.218
63.4
70.582
Dec 14, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-VL-8B-Thinking + DiG
Parameter Scale=8B, Th...
2025.12
76.7
Qwen3-VL-4B-Thinking + DiG
Parameter Scale=4B, Th...
2025.12
73.8
Qwen3-VL-8B-Thinking
Parameter Scale=8B, Th...
2025.12
73.3
Qwen3-VL-4B-Thinking
Parameter Scale=4B, Th...
2025.12
70.1
InternVL2.5-78B
Parameter Scale=78B
2025.12
57.1
Qwen2.5-VL-72B
Parameter Scale=72B
2025.12
55.2
Qwen2.5-VL-7B
Parameter Scale=7B
2025.12
50.1
Feedback
Search any
task
Search any
task