| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| HR-Bench 8K | Qwen2.5-VL-72B | Accuracy76.3 | 23 | 1mo ago | |
| HR-Bench 4K | Qwen2.5-VL-72B | Accuracy79.4 | 23 | 1mo ago | |
| V* Bench | Deepeyes-7B | Accuracy90.4 | 23 | 1mo ago | |
| VStarBench | Vero Q3I-8B | Score89.5 | 11 | 11d ago | |
| V* | Zoomeye | Average Success90.6 | 11 | 9d ago | |
| V* bench (test) | IVM-Enhanced GPT4-V | Attribute Rate87 | 10 | 1mo ago | |
| 1k image (test) | Taxonomy-decoupled | Rel P@k94.4 | 9 | 1mo ago | |
| COCO-Search18 cross-task | Accuracy (%)27.5 | 7 | 19d ago | ||
| Visual Shopping (Offline) | ViT-B/16 384x | P@154.7 | 6 | 1mo ago | |
| V-star | Penguin-VL | Accuracy83.8 | 5 | 1mo ago | |
| V* benchmark | RegionReasoner-7B | Attribute Success Rate75.65 | 5 | 1mo ago | |
| HLE-VL | Pass@136 | 4 | 25d ago | ||
| MM-BrowseComp | Seed1.8 | Pass@146.3 | 4 | 25d ago | |
| V*Bench | SEAL | Success Rate75.3 | 2 | 1mo ago |