Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Cross-modal Vision-Language Retrieval on Amstertime
Loading...
13.8
R@1
La-SigLIP-V2
0.384
3.867
7.35
10.833
Feb 3, 2026
R@1
R@5
R@10
R@20
Updated 4d ago
Evaluation Results
Method
Method
Links
R@1
R@5
R@10
R@20
La-SigLIP-V2
Augmentation=Language-...
2026.02
13.8
29
40
49.5
La-BLIP
Augmentation=Language-...
2026.02
12.8
31.4
41.1
53.2
La-CLIP
Augmentation=Language-...
2026.02
11.5
27.5
37.7
50.4
La-EVA-V2
Augmentation=Language-...
2026.02
11.5
27.9
36.8
47.8
EVA-CLIP-V2
Augmentation=None
2026.02
2.8
8
12.7
18.1
CLIP
Augmentation=None
2026.02
2
7.5
12.1
18
BLIP
Augmentation=None
2026.02
1.3
4.1
7
12.3
SigLIP-V2
Augmentation=None
2026.02
0.9
3.1
5.5
8.8
Feedback
Search any
task
Search any
task