Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination on A-OKVQA POPE (test)

90.13Accuracy (Random)

OSGA

81.70683.89386.0888.267Aug 3, 2025Sep 2, 2025Oct 2, 2025Nov 1, 2025Dec 1, 2025Dec 31, 2025Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
90.1389.7585.2785.4379.978.985.184.69------
2025.08
89.0389.782.784.772.7777.84--------
2025.08
88.688.3479.4781.8571.3776.38--------
2025.08
88.688.4180.281.8573.0376.8--------
2025.08
88.5788.1179.681.9771.4776.47--------
2025.08
88.5788.3579.381.7771.3376.4--------
2025.08
88.3388.2684.5785.0477.5779.64--------
2025.08
88.3388.3284.7785.2777.779.82--------
2026.01
88.2787.5485.1784.7479.3779.9784.2784.08------
2025.08
88.2788.2384.7385.2177.5779.67--------
2025.08
88.1388.184.4384.9577.2379.42--------
2025.08
88.1387.9684.4384.7777.679.46--------
2025.08
88.1388.2879.8381.5972.3776.39--------
2025.08
87.9388.9881.1383.7870.2376.6--------
2025.08
87.8788.9181.383.8770.5376.75--------
2025.08
87.8788.2383.7784.8275.3378.63--------
2025.08
87.788.7980.6383.4169.9776.43--------
2025.08
87.3788.580.3783.27076.42--------
2025.08
87.3387.7779.0781.2971.476.07--------
2025.08
87.188.380.183.0869.0375.9--------
2025.08
86.9787.2378.981.8267.975.01--------
2025.08
86.8386.9382.0782.9675.578.13--------
2026.01
86.386.0380.9381.5673.9376.14--87.7984.3378.9684.3370.1983.2
2026.01
86.1586.3481.8582.8274.9777.7380.9982.3------
2026.01
85.5785.0681.9381.9577.4378.9981.6482------
2026.01
85.0384.6180.680.9274.976.59--87.0982.2779.6182.2771.7582.13
2026.01
84.0383.2280.2380.0374.2775.34--87.6879.280.8779.272.3378.6
2026.01
83.4582.5679.979.5974.0475.1579.1379.1------
2025.08
82.0382.3676.978.471.6774.93--------