Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-task Multimodal Understanding on MMT-Bench (val)

72.7Score

GPT-5

60.42863.61466.869.986Apr 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
72.7
2026.04
70.9
2026.04
70.7
2026.04
70.4
2026.04
70
2026.04
69.7
2026.04
69.7
2026.04
68.1
2026.04
66.7
2026.04
60.9