Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Preferences on L-Wilder small
Loading...
85.9
Preference Score
GPT-4o
32.028
46.014
60
73.986
Dec 6, 2024
Preference Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Preference Score
GPT-4o
2024.12
85.9
Claude-3.5-Sonnet
2024.12
83.1
LLaVA-OV-72B
Model Scale=72B, Singl...
2024.12
72.9
LLaVA-OV-72B
Model Scale=72B
2024.12
72
MAmmoTH-VL-8B
Model Scale=8B, Single...
2024.12
71.3
MAmmoTH-VL-8B
Model Scale=8B
2024.12
70.8
LLaVA-OV-7B
Model Scale=7B, Single...
2024.12
69.1
LLaVA-OV-7B
Model Scale=7B
2024.12
67.8
Qwen2-VL-7B-Ins
Model Scale=7B
2024.12
66.3
InternVL-2-8B
Model Scale=8B
2024.12
62.5
Llama-3.2-11B-Vision-Ins
Model Scale=11B
2024.12
62
INXComp-2.5-7B
Model Scale=2.5B
2024.12
61.4
Qwen2-VL-72B
Model Scale=72B
2024.12
53.6
Cambrian-1-8B
Model Scale=8B
2024.12
34.1
Feedback
Search any
task
Search any
task