Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Subjective Evaluation on WildBench

0.8604Score

STEP3-VL-10B

0.31960.460.60040.7408Jan 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.8604
2026.01
0.7236
2026.01
0.6309
2026.01
0.5645
2026.01
0.3404