Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Benchmarking on MMB 1.1

82.2Accuracy

GPT-4o

27.28841.54455.870.056Sep 25, 2025
Updated 13d ago

Evaluation Results

MethodLinks
2025.09
82.2
2025.09
81.2
78.5
2025.09
77.4
75.3
2025.09
29.4