Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination on MSCOCO POPE (test)

90.7Accuracy (Random)

HGAI

-2.75970421.50387345.7674570.031027Aug 3, 2025Sep 2, 2025Oct 2, 2025Nov 1, 2025Dec 1, 2025Dec 31, 2025Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.08
90.790.7787.6788.128182.8--------
2025.08
90.189.8687.487.4481.882.82--------
2025.08
90.0390.0286.7387.1380.4782.15--------
2025.08
90.0389.9986.7387.180.782.28--------
2025.08
89.6389.6786.4386.979.981.74--------
2025.08
89.5789.6686.186.779.581.48--------
2025.08
89.078985.6386.0379.2781.01--------
2025.08
88.888.1686.786.2584.3784.21--------
2025.08
88.5387.9986.0386.0582.6783.25--------
2025.08
88.4787.7186.6386.0384.3784.04--------
2025.08
88.4387.6786.4785.8784.383.97--------
2025.08
88.487.7586.485.9283.983.75--------
2025.08
88.487.5783.9782.4482.7781.37--------
2025.08
88.387.4385.4384.8282.9782.14--------
2025.08
88.2787.5686.3785.8284.0783.82--------
2025.08
88.2387.4684.9384.4982.582.31--------
2025.08
87.385.8784.7383.4982.881.79--------
2025.08
87.1385.7283.584.0980.7781.93--------
2025.08
86.7786.1183.9783.5982.182.07--------
2025.08
86.3384.6184.182.5382.8381.4--------
2026.01
85.984.5683.4382.3380.8780.12--93.4677.288.1977.283.3677.13
2026.01
85.6383.3885.0782.8983.9381.76--98.972.0797.0572.3394.5772
2026.01
84.8783.3782.4381.279.979.06--92.5275.8787.3475.8782.5275.87
2025.08
84.183.0780.1779.2778.1778.13--------
2026.01
82.9380.8781.179.2478.6377.13--92.0172.1387.972.1382.9672.07
2026.01
82.6779.2382.9779.8981.7778.35--98.866.1397.567.6796.466
2026.01
81.7377.9481.2377.3380.8377.08--98.3764.5397.666495.8464.47
2026.01
0.87870.86430.86370.850.8450.83290.86290.8491------
2026.01
0.87530.86450.84210.8350.80880.80690.84210.8355------
2026.01
0.86840.86830.82650.83370.77310.79280.82270.8316------
2026.01
0.84870.83270.82930.81450.81070.79960.82960.8156------
2026.01
0.83490.82280.79980.79340.76030.76260.79830.7929------