Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Benchmarking on MMBench

83.4Score

LongVILA-7B (S3)

3.52824.2644565.736Nov 16, 2023Apr 15, 2024Sep 14, 2024Feb 12, 2025Jul 14, 2025Dec 12, 2025May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2024.08
83.4
2024.05
82.9
2024.02
77.34
2024.02
76.67
2024.02
76.58
2024.08
75.9
2024.07
74.38
2024.02
74.29
2026.05
73.8
2026.05
73.7
2026.05
73.6
2024.02
73.27
2024.02
73.11
2026.05
73.1
2024.08
72.7
2026.05
72.6
2026.05
71.5
2024.08
71.3
2026.05
71.1
2026.05
71.1
2024.07
70.35
2024.02
70.34
2024.08
70.3
2026.05
70.2
2024.07
70.1
2024.02
69.68
2024.07
69.5
2026.05
69.5
2026.05
69.3
2024.08
68.9
2024.07
68.7
2024.07
68.4
2024.08
68
2024.08
67.8
2024.08
67.7
2024.08
67.7
2024.08
65.9
2024.07
65.8
2024.05
65.7
2024.02
65.24
2024.08
65.2
2024.07
65
2024.08
64.6
2024.07
64.3
2024.08
64.3
2024.08
64.3
2023.11
60.9
2023.11
60.9
2024.08
60.6
2024.08
60.6
2024.07
60
2024.07
59.8
2023.11
59.5
2024.07
58.8
2024.08
58.8
2024.07
58.3
2024.07
58.2
2023.11
58.2
2024.05
57
2024.08
54.5
2024.02
48.3
2024.08
48.2
2024.07
38.7
2024.07
38.2
2024.08
38.2
2024.08
38.2
2023.11
38.1
2024.02
36.2
2024.07
36
2024.08
36
2024.08
36
2023.11
23
2024.07
6.6