Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AndroidWorld

Benchmarks

Task NameDataset NameSOTA ResultTrend
GUI Agent TaskAndroidWorld
Success Rate80
104
Mobile Task AutomationAndroidWorld (test)
Average Success Rate1
75
GUI AgentAndroidWorld
Accuracy62
70
GUI navigationAndroidWorld latest (test)
Success Rate76.7
35
End-to-end GUI NavigationAndroidWorld
Success Rate77.6
21
Mobile GUI AgentsAndroidWorld 138 tasks (test)
Success Rate71.1
18
Reward ModelingAndroidWorld
Precision92.5
14
End-to-End Environment InteractionAndroidWorld (test)
Pass@180.2
14
Mobile GUI Agent Decision MakingAndroidWorld
Success Rate59.5
5
Mobile operating system task executionAndroidWorld (AW)
AUV43.2
4
Evaluator AccuracyAndroidWorld
Overall Acc87.9
3
Showing 11 of 11 rows