Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Perception and Reasoning on MME-RealWorld Lite

54.3Overall Accuracy

Deepeyes

38.49242.59646.750.804May 20, 2025Jul 18, 2025Sep 16, 2025Nov 14, 2025Jan 13, 2026Mar 13, 2026May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.03
54.3---------50.556.6
2026.05
54.3---------50.556.6
2026.03
53.9---------4558.5
2025.05
53.29052.78943.333.476694435--
2026.03
52---------48.354.4
2026.05
52---------48.354.4
2026.05
50.7---------44.754.5
2026.03
50.1---------42.554.1
2025.05
49.789.6528638.930.971724632.5--
2026.05
49.7---------44.553.1
2026.03
48.6---------44.851
2026.05
48.2---------42.951.6
2026.05
46.9---------40.351.2
2026.05
46.2---------43.148.2
2026.03
45.8---------39.749.6
2025.05
45.687.240.78329.540.7746027.329.5--
2026.05
45.6---------36.351.6
2026.05
45.5---------39.949.1
2025.05
43.780405631.739.465333832--
2025.05
42.387.632.78327.330726228.723--
2026.05
39.1---------37.540.1