Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Utility Assessment on LLaVA-Bench In-the-Wild
Loading...
7.88
Utility Score
Base
6.2992
6.7096
7.12
7.5304
Apr 17, 2026
Utility Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Utility Score
Base
Model=LLaVA-v1.6-Mistr...
2026.04
7.88
CB
Model=LLaVA-v1.6-Mistr...
2026.04
7.06
Beam
Model=LLaVA-v1.6-Mistr...
2026.04
6.45
Safety
Model=LLaVA-v1.6-Mistr...
2026.04
6.36
Feedback
Search any
task
Search any
task