Share your thoughts, 1 month free Claude Pro on usSee more

Image-to-Text Generation on One-to-one evaluation benchmarks Image-to-Text (CLIP)

32.14CLIP Score

LLaVA-NeXT

Updated 5mo ago

Evaluation Results

Method	Links
LLaVA-NeXT 2025.12		32.14
UnifiedIO2-L 2025.12		30.73
FlowBind 2025.12		29.74
OmniFlow 2025.12		27.71
CoDi 2025.12		26.24