Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Android GUI Evaluation Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mobile GUI InteractionAndroid GUI Evaluation Benchmark 500 human-annotated trajectories (test)
Accuracy78.55
11
Showing 1 of 1 rows