Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mobile GUI Interaction on Android GUI Evaluation Benchmark 500 human-annotated trajectories (test)

78.55Accuracy

Mobile-R1

53.527660.023866.5273.0162Jun 25, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
78.5530.637.4241
2025.06
77.6929.436255
2025.06
75.930.4-280
2025.06
75.6824.429.8280
2025.06
71.7232.6-298
2025.06
71.6530-338
2025.06
63.4612.821.6523
2025.06
61.2912-461
2025.06
59.129.4-473
2025.06
56.1317.2-451
2025.06
54.497.216.33651