Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mobile Agent Evaluation on GUI-Odyssey (test)
Loading...
87.02
Grounding
Aria-UIIH
16.9656
35.1528
53.34
71.5272
Dec 20, 2024
Grounding
Task Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Grounding
Task Success Rate
Aria-UIIH
Evaluation Protocol=W....
2024.12
87.02
3,730
Aria-UITH
Evaluation Protocol=W....
2024.12
86.75
3,647
Aria-UI
Evaluation Protocol=W....
2024.12
84.57
3,187
Aria-UI
Evaluation Protocol=Ze...
2024.12
64.81
528
UGround
Evaluation Protocol=Ze...
2024.12
50.25
202
Qwen2-VL
Evaluation Protocol=Ze...
2024.12
49.56
200
SeeClick
Evaluation Protocol=Ze...
2024.12
45.19
145
GPT-4o
Evaluation Protocol=Ze...
2024.12
19.66
5
Feedback
Search any
task
Search any
task