Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image Captioning on Caption
Loading...
23.35
BLEU-4
Ours
12.17
15.0725
17.975
20.8775
Apr 20, 2026
BLEU-4
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
BLEU-4
Average Score
Ours
Model=Qwen2.5-VL
2026.04
23.35
64.03
Grid-Search
Model=Qwen2.5-VL
2026.04
22.9
63.11
Fixed-Amp
Model=Qwen2.5-VL
2026.04
22.8
62.91
Ours w/o KL
Model=Qwen2.5-VL
2026.04
22.5
62.81
Base
Model=Qwen2.5-VL
2026.04
22.4
62.59
RandNeuron
Model=Qwen2.5-VL
2026.04
22.1
62.53
Ours
Model=LLaVA-1.5
2026.04
14.07
59.46
Ours w/o KL
Model=LLaVA-1.5
2026.04
13.85
57.8
Grid-Search
Model=LLaVA-1.5
2026.04
13.15
58.7
Fixed-Amp
Model=LLaVA-1.5
2026.04
12.9
57.73
Base
Model=LLaVA-1.5
2026.04
12.85
57.23
RandNeuron
Model=LLaVA-1.5
2026.04
12.6
57.18
Feedback
Search any
task
Search any
task