Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Discriminative Hallucination Evaluation on AMBER-d
Loading...
89.2
Accuracy
Qwen2VL-7B + ACT
68.088
73.569
79.05
84.531
Apr 1, 2026
Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2VL-7B + ACT
Architecture=Qwen2VL-7...
2026.04
89.2
Qwen2VL-7B
Architecture=Qwen2VL-7...
2026.04
86.6
InternVL2.5-8B + ACT
Architecture=InternVL2...
2026.04
84.8
InternVL2.5-8B
Architecture=InternVL2...
2026.04
83.4
InternVL2-8B + ACT
Architecture=InternVL2...
2026.04
81.8
Qwen2.5VL-7B + ACT
Architecture=Qwen2.5VL...
2026.04
78.9
Qwen2.5VL-7B
Architecture=Qwen2.5VL...
2026.04
77.8
LLaVA-1.5-7B + ACT
Architecture=LLaVA-1.5...
2026.04
77
LLaVA-1.5-13B + ACT
Architecture=LLaVA-1.5...
2026.04
74
LLaVA-1.5-7B
Architecture=LLaVA-1.5...
2026.04
71.6
InternVL2-8B
Architecture=InternVL2...
2026.04
71.2
LLaVA-1.5-13B
Architecture=LLaVA-1.5...
2026.04
68.9
Feedback
Search any
task
Search any
task