Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-ended Visual Chat on LLaVA-Bench In-the-Wild (full)
Loading...
90.1
Reasoning Score
Bing-Chat-0629
78.244
81.322
84.4
87.478
Sep 18, 2023
Reasoning Score
Conversation Score
Detail Score
Overall Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Reasoning Score
Conversation Score
Detail Score
Overall Score
Bing-Chat-0629
2023.09
90.1
59.6
52.2
71.5
LLaVA-65B
beam search size=5
2023.09
88.7
59.4
65.7
74.4
LLaVA-65B
beam search size=1
2023.09
87.3
63.8
62.3
74.2
LLaVA-13B
beam search size=5
2023.09
84.3
68.4
59.9
73.5
LLaVA-33B
beam search size=5
2023.09
83.5
72.6
61.9
74.8
LLaVA-33B
beam search size=1
2023.09
82.9
70.2
62.6
73.9
LLaVA-13B
beam search size=1
2023.09
81.7
64.3
55.9
70.1
Bard-0718
2023.09
78.7
83.7
69.7
77.8
Feedback
Search any
task
Search any
task