| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CHAIR | M3ID | CHAIR_s72.8 | 252 | 23d ago | |
| MMHal-Bench | Qwen2.5-VL + FINER-Tuning | MMHal Score4.7 | 216 | 3d ago | |
| AMBER | AvisC | CHAIR14.2 | 172 | 5d ago | |
| POPE | MIRROR(ours) | Accuracy94.42 | 153 | 11d ago | |
| HallusionBench | Average Score93.1 | 108 | 4d ago | ||
| CHAIR MSCOCO | VCD | CHAIR_S59.4 | 64 | 23d ago | |
| Object HalBench | baseline | CHAIR Score (s)52.7 | 46 | 18d ago | |
| CHAIR MSCOCO 2014 (val) | Dola | CHAIRi26.2 | 39 | 1mo ago | |
| MME Hallucination | DAC | Existence Score195 | 39 | 16d ago | |
| MMHal | Score4.2 | 37 | 10d ago | ||
| MSCOCO (val) | SparseVLM | CHAIR_i23.04 | 36 | 1mo ago | |
| HallBench | ToR-DAPO | Accuracy73.6 | 31 | 22d ago | |
| POPE Adversarial v1.0 (test) | ResDec | Accuracy88.96 | 31 | 1mo ago | |
| POPE Popular v1.0 (test) | ResDec | Accuracy90.34 | 31 | 1mo ago | |
| POPE Random v1.0 (test) | ResDec | Accuracy91.17 | 31 | 1mo ago | |
| CHAIR MSCOCO 2014 | CHAIRs Score51.3 | 28 | 17d ago | ||
| COCO | CS53 | 28 | 1mo ago | ||
| MME hallucination (test) | VASparse | Existence Score180 | 24 | 1mo ago | |
| CRPE relation | InternVL2.5-26B | Accuracy79.1 | 23 | 1mo ago | |
| MSCOCO | VCD | CS Score55.2 | 21 | 1mo ago | |
| MOH | baseline | HR^D69.5 | 21 | 1mo ago | |
| MMLU-Pro Law (test) | HALL%12.1 | 21 | 1mo ago | ||
| AMBER-D | Qwen2.5-VL-32B | Score89.4 | 20 | 15d ago | |
| HaloQuest | Qwen2.5-VL + FINER-Tuning | Score S80.8 | 19 | 1mo ago | |
| HallB | SAIL | Score54.2 | 19 | 1mo ago |