Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Mathematical Reasoning on Aggregate Math Benchmarks

87.47Overall Macro Score

Qwen3.5-397B-A17B*

61.490868.235474.9881.7246May 10, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.05
87.47
2026.05
79.51
2026.05
76.55
2026.05
73.42
2026.05
63.18
2026.05
62.49