Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Utility Evaluation on LLaVA-Bench Coco
Loading...
92.3
Score
ShareGPT4V
76.7
80.75
84.8
88.85
Oct 11, 2024
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
ShareGPT4V
2024.10
92.3
CMRM_sample
Backbone=ShareGPT4V
2024.10
91.4
CMRM_dataset
Backbone=LLaVA-v1.5-13B
2024.10
90.5
VLGuard Mixed
Backbone=LLaVA-v1.5-13B
2024.10
90.4
CMRM_dataset
Backbone=ShareGPT4V
2024.10
90.1
LLaVA-v1.5-13B
2024.10
89.7
CMRM_sample
Backbone=LLaVA-v1.5-13B
2024.10
89.6
VLGuard PH
Backbone=LLaVA-v1.5-7B
2024.10
88.1
VLGuard Mixed
Backbone=LLaVA-v1.5-7B
2024.10
87.8
VLGuard PH
Backbone=LLaVA-v1.5-13B
2024.10
87.5
LLaVA-v1.5-7B
2024.10
79.2
CMRM_dataset
Backbone=LLaVA-v1.5-7B
2024.10
78.7
CMRM_sample
Backbone=LLaVA-v1.5-7B
2024.10
77.3
Feedback
Search any
task
Search any
task