Share your thoughts, 1 month free Claude Pro on usSee more

Multi-modal Understanding on LLaVA-Bench Wild

91.2LLaVA^W Score

GPT4V

Updated 3mo ago

Evaluation Results

Method
GPT4V 2023.11	91.2	-	-
GPT4V 2023.11	88.8	-	-
Qwen-VL-Max + AutoV 2025.06	88.4	-	-
GPT4V 2023.11	88.2	-	-
VILA2-8B 2024.07	86.6	-	-
Qwen-VL-Max 2025.06	85.8	-	-
ASPO 2025.05	82	68.8	-
MM1-7B-Chat 2024.07	81.5	-	-
Gemini-1.5-Pro + AutoV 2025.06	81.1	-	-
Gemini-1.5-Pro 2025.06	77.9	-	-
Honeybee 2023.12	77.5	-	-
DOP-OBC 2026.04	77.3	-	-
DOP-OBC 2026.04	76.4	-	-
DOP-OBC 2026.04	75.9	-	-
Honeybee 2023.12	75.7	-	-
ASPO 2025.05	75.7	65.16	-
AlignGPT 2024.05	75.2	-	-
CAMD 2026.03	75	-	-
CAMD 2026.03	75	-	-
LLaVA-1.5-13B 2023.11	74.9	-	-
DPO 2025.05	74.7	67.03	-
FarSight 2026.03	74.7	-	-
FarSight 2026.04	74.7	-	-
FarSight 2026.03	74.5	-	-
CAMD 2026.03	73.6	-	-
LLaVA-1.5-13B 2023.11	73.5	-	-
Video-LLaVA 2026.03	73.1	-	-
Base 2026.04	73.1	-	-
Honeybee 2023.12	72.9	-	-
FarSight 2026.03	72.6	-	-
LLaVA-1.5 2026.03	72.5	-	-
Base 2026.04	72.5	-	-
OPERA 2026.03	72	-	-
OPERA 2026.04	72	-	-
VisionZip 2024.12	71.3	-	106.7
CGD 2026.03	71.3	-	-
Sphinx 2023.11	71	-	-
GPT-4o + AutoV 2025.06	70.9	-	-
VCD 2026.03	70.9	-	-
VCD 2026.04	70.9	-	-
LLaVA-1.5 2023.12	70.7	-	-
LLaVA-v1.5 2023.12	70.7	-	-
LLaVA-1.5-13B 2024.06	70.7	-	-
LLaVA-1.5 2024.05	70.7	-	-
LLaVA-1.5-13B 2025.05	70.7	65.93	-
Chat-UniVi 2026.03	70.4	-	-
Base 2026.04	70.4	-	-
LLaVA-Instruct-13B 2024.06	70.1	-	-
Sphinx 2023.11	70	-	-
Sphinx 2023.11	69.8	-	-
ICD 2026.03	69.7	-	-
ICD 2026.04	69.7	-	-
LLaVA-1.5-13B 2023.11	68.5	-	-
AlignGPT 2024.05	68.4	-	-
GPT-4o 2025.06	68	-	-
ASPO 2025.05	67.4	45.84	-
Honeybee 2023.12	67.1	-	-
Vanilla 2024.12	66.8	-	100
VisionZip 2024.12	66.7	-	99.9
Honeybee 2023.12	66.3	-	-
DPO 2025.05	65.7	63.12	-
LLaVA-Instruct-7B 2024.06	65.1	-	-
DOP-OBC 2026.04	64.3	-	-
VisionZip 2024.12	63.5	-	95.1
LLaVA-1.5 2023.12	63.4	-	-
LLaVA-1.5-7B 2024.06	63.4	-	-
LLaVA-1.5 2024.05	63.4	-	-
LLaVA-1.5-7B 2025.05	63.4	62.59	-
LLaVA-7B 2025.05	63	-	-
DPO 2025.05	62.7	44.5	-
CAMD 2026.03	61.2	-	-
FarSight 2026.03	61	-	-
InstructBLIP 2023.12	60.9	-	-
InstructBLIP-7B 2024.06	60.9	-	-
InstructBLIP 2024.05	60.9	-	-
InstructBLIP 2023.12	58.2	-	-
InstructBLIP 2024.05	58.2	-	-
InstructBLIP-13B 2025.05	58.2	43.86	-
InstructBLIP 2026.03	58.2	-	-
Base 2026.04	58.2	-	-
InstructBLIP-13B 2023.11	47.9	-	-
InstructBLIP-13B 2023.11	47.2	-	-
InstructBLIP-13B 2023.11	45.4	-	-
BLIP-2 2023.12	38.1	-	-
BLIP-2 2024.05	38.1	-	-
BLIP-2-7B 2025.05	38.1	-	-