Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Discriminative Hallucination Detection on AMBER
Loading...
89.4
Accuracy
InternVL-3.5 + FINER-Tuning
77.648
80.699
83.75
86.801
Mar 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
InternVL-3.5 + FINER-Tuning
Size=14B
2026.03
89.4
InternVL-3.5 + FINER-Tuning
Size=8B
2026.03
88.6
InternVL-3.5
Size=8B
2026.03
88.2
InternVL-3.5
Size=14B
2026.03
88
OmniLMM + RLAIF-V
Size=12B
2026.03
87.4
OmniLMM
Size=12B
2026.03
86.9
Qwen2.5-VL + FINER-Tuning
Size=7B
2026.03
85.8
Qwen2.5-VL
Size=7B
2026.03
85.2
LLaVA-1.6 + FINER-Tuning
Size=7B
2026.03
85
LLaVA-1.6
Size=7B
2026.03
78.1
Feedback
Search any
task
Search any
task