| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RefCOCO+ (val) | Accuracy91.4 | 253 | 5d ago | ||
| RefCOCO+ (testA) | InternVL2.5 | Accuracy94.7 | 245 | 5d ago | |
| RefCOCO+ (testB) | Accuracy87.9 | 219 | 5d ago | ||
| RefCOCO (val) | Accuracy95.2 | 172 | 23d ago | ||
| RefCOCO (TestA) | Accuracy96.5 | 162 | 5d ago | ||
| RefCOCO (TestB) | Accuracy92.6 | 159 | 26d ago | ||
| RefCOCOg (val) | Accuracy93.2 | 158 | 5d ago | ||
| RefCOCOg (test) | Accuracy93.3 | 155 | 5d ago | ||
| HRBench8K | ERASE | Accuracy75.63 | 51 | 21d ago | |
| RefCOCOg | InternVL2-26B | Accuracy88.44 | 45 | 19d ago | |
| RefCOCO+ | GiT-H universal | Accuracy @ 0.5 IoU88.3 | 38 | 1mo ago | |
| ScanRefer v1 (val) | Proxy3D | Acc@0.5 (Unique)84 | 35 | 23d ago | |
| DIOR-RSVG | LAE-DINO | Accuracy@0.586.7 | 34 | 1mo ago | |
| Who's Waldo (test) | Who's Waldo | Accuracy63.5 | 31 | 3mo ago | |
| ReferCOCO v1 (testB) | mPLUG | Acc @ 0.588.42 | 30 | 3mo ago | |
| V* | Accuracy83.77 | 29 | 21d ago | ||
| RefFLIR 1.0 (val) | RGBT-VGNet | Accuracy @ 0.5 IoU73.68 | 29 | 3mo ago | |
| Flickr30k Entities (test) | AMC | Accuracy86.59 | 29 | 3mo ago | |
| BLINK | ERASE | Accuracy64.49 | 27 | 14d ago | |
| VIEW2SPACE v1 | Ours (Grounded CoT) | mIoU69.34 | 27 | 2mo ago | |
| ScreenSpot-Pro 1.0 (test) | Qwen3VL-32B-Instruct | Development Score71.6 | 27 | 3mo ago | |
| ReferitGame (test) | RefTR | Pr@0.50.7142 | 26 | 3mo ago | |
| ReferCOCO+ v1 (testA) | mPLUG | Acc@0.590.17 | 24 | 3mo ago | |
| RefCOCO (test) | Original Grounding CogVLM | Accuracy91.4 | 23 | 14d ago | |
| MMB v1.1 | Accuracy85.76 | 22 | 21d ago |