Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Large Multi-modal Model Evaluation on LLaVA-Bench In-the-Wild v1
Loading...
65.5
Conversational Score
LLaVA-Plus
29.724
39.012
48.3
57.588
Nov 9, 2023
Conversational Score
Detail Retention Score
Reasoning Score
Overall Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Conversational Score
Detail Retention Score
Reasoning Score
Overall Score
LLaVA-Plus
tool_usage=All Tools
2023.11
65.5
56.8
79.1
69.5
LLaVA-Plus
tool_usage=Fly
2023.11
45.2
50.4
72.6
59.1
LLaVA
2023.11
42.6
51.9
68.9
57.1
LLaVA
evaluation_protocol=To...
2023.11
40.7
48.1
51.2
47.5
LLaVA-Plus
tool_usage=Fly, reason...
2023.11
38.8
39.8
59.8
48.7
GPT4Tools
2023.11
31.1
27.1
54.1
40.7
Feedback
Search any
task
Search any
task