Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Understanding on RealWorldQA

68.23Accuracy (Clean)

Robust-R1 (SFT)

38.870846.492954.11561.7371Dec 19, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
68.2367.5867.3263.92
2025.12
67.7166.467.0563.26
2025.12
65.2264.9663.3960.65
2025.12
57.3858.1657.6454.9
2025.12
55.4254.7753.7252.81
2025.12
43.2642.4842.6141.43
2025.12
4039.7339.4738.69