Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GUI Agent Task Planning on AITW

36.34General Step Accuracy

Gemini-2.0-flash-exp + Qwen2-VL-7B (AutoGUI)

13.990419.792725.59531.3973Feb 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
36.3436.5450.9548.9540.9940.5250.9548.9532.8343.5239.23
2025.02
26.3718.1628.4926.9130.322.8841.9428.9520.2222.6529.5
2025.02
20.4320.5625.5922.4915.2512.3325.5922.4916.1520.5318.37
2025.02
14.859.5811.175.7612.086.8521.0911.2410.8911.2214.01