Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Image-to-Text Generation on One-to-one evaluation benchmarks Image-to-Text (CLIP)

32.14CLIP Score

LLaVA-NeXT

26.00427.59729.1930.783Dec 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
32.14
2025.12
30.73
2025.12
29.74
2025.12
27.71
2025.12
26.24