Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-ended Visual Question Answering on LLaVA Eval v1 (test)
Loading...
77.67
Conversation Score
DRESS
54.01
60.1525
66.295
72.4375
Nov 16, 2023
Conversation Score
Description Score
Reasoning Score
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Conversation Score
Description Score
Reasoning Score
Average Score
DRESS
prefix=<excellent> [Ni...
2023.11
77.67
62.17
84.27
74.7
InstructBLIP
LLM=Vicuna-13B
2023.11
74.08
61.67
82.17
72.64
LLaVA-HF
LLM=Vicuna-13B
2023.11
69.74
60.87
85.33
71.98
BLIP-2
LLM=T5-XXL
2023.11
66.08
31.33
22
39.8
mPLUG
LLM=LLaMA-7B
2023.11
66.08
44.17
75.83
62.03
LLaVA
LLM=LLaMA-13B
2023.11
65.17
42.17
61.5
56.28
miniGPT4
LLM=Vicuna-13B
2023.11
54.92
51.5
74.67
60.36
Feedback
Search any
task
Search any
task