Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Benchmark on POPE-R

88.6Accuracy

V-STAR

81.8483.59585.3587.105Apr 11, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.04
88.6
2026.04
88
2026.04
86.9
2026.04
85.5
2026.04
85
2026.04
84.6
2026.04
82.4
2026.04
82.1