Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AndroidControl

Benchmarks

Task NameDataset NameSOTA ResultTrend
GUI planningAndroidControl Low
SR (%)86.4
31
GUI AutomationAndroidControl High
Task Match (TM)83.7
27
GUI reasoningAndroidControl Low
SR91.8
24
UI NavigationAndroidControl (offline)
Step Success Rate79.1
23
GUI UnderstandingAndroidControl High
Task Match Rate (TM)83.7
22
Mobile Agent EvaluationAndroidControl Low (test)
Task Success Rate93.7
22
GUI planningAndroidControl High
SR67.5
21
Step success rateAndroidControl Task-Unseen
SSR62.3
20
GUI NavigationAndroidControl High
SR (Success Rate)76.3
17
General Agent CapabilityAndroidControl High
Type Rate84.7
17
High-level instruction followingAndroidControl
Step Accuracy72.7
16
Short-term planningAndroidControl Low
Type85.17
16
General Agent CapabilityAndroidControl Low
Type Score97.2
15
Step ExecutionAndroidControl High
Step Success Rate75.3
15
Mobile Agent NavigationAndroidControl 1.0 (test)
Step Success Rate (SR)72.9
15
Mobile GUI Agent ExecutionAndroidControl Curated Easy
Type Success Rate88.6
15
Mobile GUI Agent ExecutionAndroidControl Curated-Hard
Type Rate80.8
15
Mobile UI AutomationAndroidControl High (test)
Success Rate (SR)80.3
14
Mobile navigationAndroidControl
Step Accuracy76.1
14
Long-term PlanningAndroidControl High
Type Rate71.79
14
GUI AutomationAndroidControl AC-Real
PG32.4
13
Step success rateAndroidControl (Cat-Unseen)
Step Success Rate (SSR)61.2
10
Step success rateAndroidControl (IDD)
Step Success Rate (SSR)69.4
10
GUI Interaction ControlAndroidControl (High)
Type Score86.7
10
High-level instruction executionAndroidControl task-UN
Step Accuracy72.2
8
Showing 25 of 40 rows