Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on R1-Onevision-Bench (Overall)

39.2Accuracy

MSSR

2125.72530.4535.175Dec 20, 2025Dec 28, 2025Jan 6, 2026Jan 15, 2026Jan 23, 2026Feb 1, 2026Feb 10, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
39.2--
2025.12
39.1--
2025.12
38.5--
2025.12
38.4--
2025.12
37.7--
2025.12
35.7--
2025.12
35.2--
2025.12
34.7--
2025.12
34.6--
2026.02
34.1442.613
2025.12
34--
2026.02
33.8621.218.4
2026.02
33.3112.93.4
2026.02
32384.912
2025.12
30.2--
2026.02
29.1656.222.5
2025.12
29--
2026.02
28.9111.33.9
2025.12
28.8--
2026.02
28.7446.915.6
2026.02
2838013.6
2025.12
27.6--
2025.12
21.7--