Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Image-to-Text Generation on One-to-one evaluation benchmarks Image-to-Text (CLIP)
Loading...
32.14
CLIP Score
LLaVA-NeXT
26.004
27.597
29.19
30.783
Dec 17, 2025
CLIP Score
Updated 4d ago
Evaluation Results
Method
Method
Links
CLIP Score
LLaVA-NeXT
Category=Specialists
2025.12
32.14
UnifiedIO2-L
Category=Generalists
2025.12
30.73
FlowBind
Category=Generalists
2025.12
29.74
OmniFlow
Category=Generalists
2025.12
27.71
CoDi
Category=Generalists
2025.12
26.24
Feedback
Search any
task
Search any
task