Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Robustness on POPE random
Loading...
80.56
Accuracy
FTibVLM
46.2088
55.1269
64.045
72.9631
May 26, 2026
Accuracy
F1 Score
Updated 7d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
FTibVLM
2026.05
80.56
80.51
Base
2026.05
47.53
43.62
Feedback
Search any
task
Search any
task