Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Evaluation on MMT-Bench

62.65Accuracy

Mix + CL + CARE

57.564458.884760.20561.5253Dec 16, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
62.65
2025.12
62.62
2025.12
62.62
2025.12
62.52
2025.12
61.91
2025.12
61.18
2025.12
60.63
2025.12
60.47
2025.12
59.96
2025.12
59.83
2025.12
59.64
2025.12
58.94
2025.12
57.76