Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Question Answering on MMStar (test)
Loading...
72.7
Accuracy
Proprietary API SOTA (SenseTime, 2024)
11.132
27.116
43.1
59.084
Jan 21, 2025
Apr 6, 2025
Jun 20, 2025
Sep 4, 2025
Nov 18, 2025
Feb 1, 2026
Apr 18, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Proprietary API SOTA (SenseTime, 2024)
Model Type=Proprietary...
2025.01
72.7
Open-Source SOTA (Chen et al., 2024d)
Model Type=Open-Source...
2025.01
63.2
IXC-2.5
Model Type=Open-Source...
2025.01
59.9
IXC-2.5-Chat
Model Type=Open-Source...
2025.01
59.6
PivotMerge
Partitioning=Clustering
2026.04
27.5
TSV-M
Partitioning=Clustering
2026.04
27.3
CC12M Split 5
Partitioning=Clustering
2026.04
26.7
CC12M Split 3
Partitioning=Clustering
2026.04
26.5
CC12M Split 2
Partitioning=Clustering
2026.04
25.7
Task Arithmetic
Partitioning=Clustering
2026.04
25.7
TIES Merging
Partitioning=Clustering
2026.04
25.1
Weight Average
Partitioning=Clustering
2026.04
23.9
TIES w/ DARE
Partitioning=Clustering
2026.04
23.7
MetaGPT
Partitioning=Clustering
2026.04
23.3
CC12M Split 4
Partitioning=Clustering
2026.04
22.9
CC12M Split 1
Partitioning=Clustering
2026.04
20
Iso-C
Partitioning=Clustering
2026.04
13.5
Feedback
Search any
task
Search any
task