Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Understanding on MMMU Pro (Overall)
Loading...
38.3
Score
Vanilla
33.828
34.989
36.15
37.311
Dec 9, 2024
Score
Updated 25d ago
Evaluation Results
Method
Method
Links
Score
Vanilla
Mode=Upper bound
2024.12
38.3
SparseVLM
Image token reduction...
2024.12
38.3
VisionZip
Image token reduction...
2024.12
38.2
iLLaVA
Image token reduction...
2024.12
38.1
PyramidDrop
Image token reduction...
2024.12
37.8
iLLaVA
Image token reduction...
2024.12
37.8
FasterVLM
Image token reduction...
2024.12
37.7
VisionZip
Image token reduction...
2024.12
37.5
SparseVLM
Image token reduction...
2024.12
36.6
iLLaVA
Image token reduction...
2024.12
36.6
FasterVLM
Image token reduction...
2024.12
36.5
PyramidDrop
Image token reduction...
2024.12
36.4
FasterVLM
Image token reduction...
2024.12
36.4
SparseVLM
Image token reduction...
2024.12
36.2
VisionZip
Image token reduction...
2024.12
35.9
PyramidDrop
Image token reduction...
2024.12
34
Feedback
Search any
task
Search any
task