Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Benchmark on POPE-R
Loading...
88.6
Accuracy
V-STAR
81.84
83.595
85.35
87.105
Apr 11, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
V-STAR
Data Size=40k
2026.04
88.6
Vision-R1
Data Size=210k
2026.04
88
ThinkLite-VL
Data Size=11k
2026.04
86.9
VL-Rethinker
Data Size=39k
2026.04
85.5
VL-Cogito
Data Size=80k
2026.04
85
R1-Onevision
Data Size=155k
2026.04
84.6
OpenVLThinker
Data Size=59.2k
2026.04
82.4
Qwen2.5VL
2026.04
82.1
Feedback
Search any
task
Search any
task