Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Multimodal Reasoning on General Benchmarks

57.8Top-1 Accuracy

MUPO

51.0452.79554.5556.305Apr 1, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.04
57.865.2
2026.04
55.459.2
2026.04
55.159.3
2026.04
53.957.7
2026.04
53.158.9
2026.04
51.362.3