Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Real-World Understanding on MME RealWorld
Loading...
71.6
Score
Gemini 2.5 Pro
56.728
60.589
64.45
68.311
Nov 7, 2025
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
Gemini 2.5 Pro
Tool=Code, Param Size=-
2025.11
71.6
DeepEyesV2
Tool=General, Param Si...
2025.11
64.9
DeepEyesV2
Tool=General, Param Si...
2025.11
64.9
Thyme
Tool=Code, Param Size=7B
2025.11
64.8
Thyme
Tool=Code, Param Size=7B
2025.11
64.8
Pixel-Reasoner
Tool=Crop, Param Size=7B
2025.11
64.4
Pixel-Reasoner
Tool=Crop, Param Size=7B
2025.11
64.4
LLaVA-OV
Tool=✗, Param Size=7B
2025.11
57.4
LLaVA-OV
Tool=✗, Param Size=7B
2025.11
57.4
Qwen2.5-VL
Tool=✗, Param Size=7B
2025.11
57.3
Qwen2.5-VL
Tool=✗, Param Size=7B
2025.11
57.3
Feedback
Search any
task
Search any
task