Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-modal Retrieval (Image-Text to Text/Image-Text) on OVEN QS
Loading...
8.39
Recall@5
VISTA
-0.2732
1.9759
4.225
6.4741
Jun 6, 2024
Recall@5
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall@5
VISTA
#Params=196M, Zero-sho...
2024.06
8.39
CLIP
#Params=149M, Zero-sho...
2024.06
1.06
Pic2Word
#Params=224M, Zero-sho...
2024.06
0.97
Pic2Word-MM
#Params=224M, Zero-sho...
2024.06
0.82
CLIP-MM
#Params=149M, Zero-sho...
2024.06
0.4
BLIP-MM
#Params=224M, Zero-sho...
2024.06
0.27
BLIP
#Params=224M, Zero-sho...
2024.06
0.06
Feedback
Search any
task
Search any
task