Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Evaluator Accuracy on AndroidWorld

87.9Overall Acc

StepCritic

82.1883.66585.1586.635Apr 27, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.04
87.992.882.3
2025.04
84.6--
2025.04
82.4--