Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Object Hallucination Detection on POPE popular
Loading...
86.5
F1 Score
RSP
83.796
84.498
85.2
85.902
May 27, 2026
F1 Score
Trigger Percentage
Updated 6d ago
Evaluation Results
Method
Method
Links
F1 Score
Trigger Percentage
RSP
Model=LLaVA-1.5, Routi...
2026.05
86.5
5
Baseline
Model=LLaVA-1.5
2026.05
86.4
-
Always-on
Model=LLaVA-1.5
2026.05
86.2
-
RSP
Model=InstructBLIP, Pr...
2026.05
85.2
14
Always-on
Model=InstructBLIP, Pr...
2026.05
84.2
-
Baseline
Model=InstructBLIP
2026.05
83.9
-
Feedback
Search any
task
Search any
task