Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Question Answering on PlantVillage VQA (test)
Loading...
2
BLEU-4
AgriChat
-0.0488
0.4831
1.015
1.5469
Mar 14, 2026
BLEU-4
ROUGE-2
METEOR
BERTScore
LongCLIP
T5 Cosine Similarity
SBERT
LLM Judge Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
BLEU-4
ROUGE-2
METEOR
BERTScore
LongCLIP
T5 Cosine Similarity
SBERT
LLM Judge Accuracy
AgriChat
Model Scale=7B
2026.03
2
3.18
19.52
83.58
75.2
44.8
32.13
74.26
LLaVA-OneVision
Model Scale=7B
2026.03
0.14
0.65
17.25
86.02
80.65
51.65
41.64
57.41
Llama-3.2
Model Scale=11B
2026.03
0.08
0.5
6.72
80.37
79.11
37.85
20.95
54.44
Qwen-2.5
Model Scale=7B
2026.03
0.03
0.22
3.43
79.12
79.89
28.2
10.9
53.21
Feedback
Search any
task
Search any
task