Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Capability Evaluation on MM-Star
Loading...
51.6
Average Score
LLaVA-NeXT
31.32
36.585
41.85
47.115
May 27, 2024
Average Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Average Score
LLaVA-NeXT
Size=34B, Feedback=X
2024.05
51.6
GPT-4V
Size=Unknown, Feedback...
2024.05
50.4
MiniGemini
Size=34B, Feedback=X
2024.05
45.5
OmniLMM + RLAIF-V
Size=12B, Feedback=self
2024.05
40.9
OmniLMM
Size=12B, Feedback=X
2024.05
39.7
LLaVA 1.5 + RLAIF-V
Size=7B, Feedback=LLaV...
2024.05
35.4
AMP-MEG
Size=13B, Feedback=Rule
2024.05
34.8
Qwen-VL-Chat
Size=10B, Feedback=X
2024.05
34.5
POVID
Size=7B, Feedback=Rule
2024.05
34.3
LLaVA-RLHF
Size=13B, Feedback=Human
2024.05
34.2
VCD
Size=7B, Feedback=X
2024.05
33.8
Silkie
Size=10B, Feedback=GPT-4V
2024.05
33.6
LLaVA 1.5
Size=7B, Feedback=X
2024.05
33.3
RLHF-V
Size=13B, Feedback=Human
2024.05
33.2
Less-is-more
Size=7B, Feedback=X
2024.05
32.9
OPERA
Size=7B, Feedback=X
2024.05
32.9
HA-DPO
Size=7B, Feedback=Rule
2024.05
32.9
CCA-LLaVA
Size=7B, Feedback=X
2024.05
32.1
Feedback
Search any
task
Search any
task