Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination Assessment on A-OKVQA POPE (Adversarial)

0.8363Accuracy

SIRA

0.665740.710020.75430.79858Jan 3, 2025Mar 26, 2025Jun 17, 2025Sep 8, 2025Nov 29, 2025Feb 20, 2026May 14, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
0.8363-0.8461-
2026.05
0.836-0.8302-
2026.05
0.8275-0.8395-
2026.05
0.8183-0.8133-
2026.05
0.8127-0.8238-
2025.12
0.81260.80970.8104-
2026.05
0.8084-0.8151-
2026.05
0.8082-0.8154-
2025.12
0.80750.80370.8046-
2026.05
0.8037-0.7968-
2025.12
0.8020.79080.8058-
2025.12
0.7950.77540.8021-
2025.12
0.79130.76040.803-
2025.01
0.79130.76450.80140.842
2026.05
0.773-0.7943-
2025.01
0.7720.75470.77950.806
2025.12
0.7690.75590.7748-
2026.05
0.759-0.784-
2025.01
0.7570.71380.77930.858
2026.05
0.7556-0.7868-
2026.05
0.755-0.7794-
2026.05
0.7497-0.7773-
2025.01
0.74970.70010.77730.8736
2025.01
0.74420.70240.78480.8893
2025.01
0.74330.69460.77190.8687
2026.05
0.7404-0.7515-
2025.01
0.74040.72080.75150.7849
2026.05
0.7382-0.7791-
2025.12
0.73440.66970.7716-
2025.12
0.71870.65650.7596-
2025.12
0.7130.6810.7826-
2025.12
0.710.65410.7545-
2025.01
0.70710.65910.75560.8583
2025.12
0.7070.6670.7686-
2025.12
0.70270.64150.7555-
2025.12
0.7010.64280.7516-
2025.12
0.69870.64540.7454-
2025.12
0.6860.62220.7511-
2025.12
0.68570.62260.7499-
2026.05
0.6803-0.7449-
2025.12
0.6740.61390.7421-
2025.12
0.67230.61560.737-