Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Large Language Model Evaluation on MME-RealWorld

43.7Reasoning

Thyme

Updated 4mo ago

Evaluation Results

Method	Links
Thyme 2025.10		43.7	60.1	58.1
UG-Search 2025.10		35.1	58.5	55.7
ViCrop 2025.10		34.2	53.4	51.1
TextCoT 2025.10		33.7	53.3	50.9
QwenVL2.5-7B 2025.10		33.4	51.5	49.3