Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-modal Understanding on LLaVA-Bench Wild

91.2LLaVA^W Score

GPT4V

35.97650.31364.6578.987Nov 27, 2023Feb 25, 2024May 26, 2024Aug 25, 2024Nov 24, 2024Feb 23, 2025May 25, 2025
Updated 2d ago

Evaluation Results

MethodLinks
2023.11
91.2--
2023.11
88.8--
2023.11
88.2--
2024.07
86.6--
2025.05
8268.8-
2024.07
81.5--
2023.12
77.5--
2023.12
75.7--
2025.05
75.765.16-
2024.05
75.2--
2023.11
74.9--
2025.05
74.767.03-
2023.11
73.5--
2023.12
72.9--
2024.12
71.3-106.7
2023.11
71--
2023.12
70.7--
2023.12
70.7--
2024.06
70.7--
2024.05
70.7--
2025.05
70.765.93-
2024.06
70.1--
2023.11
70--
2023.11
69.8--
2023.11
68.5--
2024.05
68.4--
2025.05
67.445.84-
2023.12
67.1--
2024.12
66.8-100
2024.12
66.7-99.9
2023.12
66.3--
2025.05
65.763.12-
2024.06
65.1--
2024.12
63.5-95.1
2023.12
63.4--
2024.06
63.4--
2024.05
63.4--
2025.05
63.462.59-
2025.05
63--
2025.05
62.744.5-
2023.12
60.9--
2024.06
60.9--
2024.05
60.9--
2023.12
58.2--
2024.05
58.2--
2025.05
58.243.86-
2023.11
47.9--
2023.11
47.2--
2023.11
45.4--
2023.12
38.1--
2024.05
38.1--
2025.05
38.1--