Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Explanation on VQA-X
Loading...
94.11
F1 Score
LLaVA
32.6876
48.6338
64.58
80.5262
Dec 7, 2025
F1 Score
BLEU-1
BLEU-2
BLEU-3
BLEU-4
BERTScore
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
BLEU-1
BLEU-2
BLEU-3
BLEU-4
BERTScore
LLaVA
fine-tuned=true
2025.12
94.11
0.07
0.022
0.011
0.008
0.679
PaLiGemma
fine-tuned=true
2025.12
94.08
0.112
0.034
0.026
0.024
0.712
LLaVA
shots=2, fine-tuned=false
2025.12
91.11
0.033
0.014
0.007
0.004
0.653
LLaVA
shots=0, fine-tuned=false
2025.12
88.12
0.045
0.022
0.011
0.002
0.642
PaLiGemma
shots=2, fine-tuned=false
2025.12
35.74
0.004
0.001
0.001
0
0.651
PaLiGemma
shots=0, fine-tuned=false
2025.12
35.05
0.004
0.001
0.001
0
0.653
Feedback
Search any
task
Search any
task