| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RefCOCO+ (val) | Accuracy91.4 | 171 | 2d ago | ||
| RefCOCO+ (testB) | Accuracy87.9 | 169 | 2d ago | ||
| RefCOCO+ (testA) | InternVL2.5 | Accuracy94.7 | 168 | 2d ago | |
| RefCOCO (TestB) | Accuracy92.6 | 125 | 2d ago | ||
| RefCOCO (val) | Accuracy95.2 | 119 | 2d ago | ||
| RefCOCO (TestA) | Accuracy96.5 | 117 | 2d ago | ||
| RefCOCOg (test) | Accuracy93.3 | 96 | 2d ago | ||
| RefCOCOg (val) | Accuracy93.2 | 93 | 2d ago | ||
| Who's Waldo (test) | Who's Waldo | Accuracy63.5 | 31 | 4d ago | |
| ScanRefer v1 (val) | GPT4Scene | Acc@0.5 (All)57 | 30 | 4d ago | |
| ReferCOCO v1 (testB) | mPLUG | Acc @ 0.588.42 | 30 | 4d ago | |
| RefFLIR 1.0 (val) | RGBT-VGNet | Accuracy @ 0.5 IoU73.68 | 29 | 4d ago | |
| Flickr30k Entities (test) | AMC | Accuracy86.59 | 29 | 4d ago | |
| ScreenSpot-Pro 1.0 (test) | Qwen3VL-32B-Instruct | Development Score71.6 | 27 | 4d ago | |
| ReferitGame (test) | RefTR | Pr@0.50.7142 | 26 | 4d ago | |
| DIOR-RSVG | LAE-DINO | Accuracy@0.586.7 | 25 | 4d ago | |
| ReferCOCO+ v1 (testA) | mPLUG | Acc@0.590.17 | 24 | 4d ago | |
| RefCOCO+ | GiT-H universal | Accuracy @ 0.5 IoU88.3 | 20 | 4d ago | |
| ReferCOCO+ v1 (val) | mPLUG | Accuracy @ IoU 0.586.02 | 20 | 4d ago | |
| Chest X-ray Visual Grounding | Aortic Enlargement Score69.29 | 19 | 4d ago | ||
| ReferCOCOg UMD (test-u) | mPLUG | Acc@0.586.42 | 19 | 4d ago | |
| ReferCOCO v1 (val) | mPLUG | Acc@0.592.4 | 19 | 4d ago | |
| Lisa Grounding | Visual Jigsaw | Accuracy75.69 | 18 | 4d ago | |
| RefCOCOg | InternVL2-26B | Accuracy88.44 | 17 | 4d ago | |
| ReferCOCOg UMD (val) | mPLUG | Acc@0.585.88 | 17 | 4d ago |