Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Large Language Model Evaluation on MME-RealWorld

43.7Reasoning

Thyme

32.98835.76938.5541.331Oct 1, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
43.760.158.1
2025.10
35.158.555.7
2025.10
34.253.451.1
2025.10
33.753.350.9
33.451.549.3