Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Grounding on RefCOCO+ (testA+)
Loading...
93.9
Accuracy
Youtu-VL
78.092
82.196
86.3
90.404
Jan 27, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Youtu-VL
Model Size=4B
2026.01
93.9
Florence-2
Backbone=DaViT-B
2026.01
92.9
InternVL-3.5
Model Size=4B
2026.01
92.3
UFO
Backbone=InternVL2.5-8B
2026.01
92.1
Griffon
Backbone=LLama2-13B
2026.01
90.5
Qwen3-VL
Model Size=4B
2026.01
89.4
Grounding DINO
Backbone=Swin-L
2026.01
89
MDETR
Backbone=ENB3
2026.01
85.5
VisionLLM v2
Backbone=Swin-T
2026.01
83.8
GLaMM
Backbone=Vicuna-7B
2026.01
78.7
Feedback
Search any
task
Search any
task