Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Preferences on WildVision 0617
Loading...
89.4
Score
GPT-4o
8.592
29.571
50.55
71.529
Dec 6, 2024
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
GPT-4o
2024.12
89.4
LLaVA-OV-7B
Model Scale=7B
2024.12
53.8
Qwen2-VL-72B
Model Scale=72B
2024.12
52.3
LLaVA-OV-72B
Model Scale=72B
2024.12
52.3
MAmmoTH-VL-8B
Model Scale=8B, Single...
2024.12
51.9
InternVL-2-8B
Model Scale=8B
2024.12
51.5
MAmmoTH-VL-8B
Model Scale=8B
2024.12
51.1
Claude-3.5-Sonnet
2024.12
50
Llama-3.2-11B-Vision-Ins
Model Scale=11B
2024.12
49.7
LLaVA-OV-72B
Model Scale=72B, Singl...
2024.12
49.5
Qwen2-VL-7B-Ins
Model Scale=7B
2024.12
44
Molmo-7B-D
Model Scale=7B
2024.12
40
LLaVA-OV-7B
Model Scale=7B, Single...
2024.12
39.2
MiniCPM-V-2.6-7B
Model Scale=7B
2024.12
11.7
Feedback
Search any
task
Search any
task