Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Cross-modal Vision-Language Retrieval on Pitts30
Loading...
50.2
R@1
La-BLIP
6.416
17.783
29.15
40.517
Feb 3, 2026
R@1
R@5
R@10
R@20
Updated 4d ago
Evaluation Results
Method
Method
Links
R@1
R@5
R@10
R@20
La-BLIP
Augmentation=Language-...
2026.02
50.2
74.2
82.8
89.6
La-CLIP
Augmentation=Language-...
2026.02
49.3
74.4
82.8
89.7
La-SigLIP-V2
Augmentation=Language-...
2026.02
49.3
74.6
82.8
89.2
La-EVA-V2
Augmentation=Language-...
2026.02
41.5
66.8
76.6
85.1
EVA-CLIP-V2
Augmentation=None
2026.02
11.4
30.9
44.2
59.9
CLIP
Augmentation=None
2026.02
10.9
29.6
43.5
59.6
SigLIP-V2
Augmentation=None
2026.02
10.1
30.1
44.3
61.9
BLIP
Augmentation=None
2026.02
8.1
26.8
41.1
56.7
Feedback
Search any
task
Search any
task