Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
GUI Agent Navigation on GUI-Odyssey (test)
Loading...
66
Type Accuracy
WebFactory-3B
36.36
44.055
51.75
59.445
Mar 5, 2026
Type Accuracy
Grounding Rate (GR)
Success Rate (SR)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Type Accuracy
Grounding Rate (GR)
Success Rate (SR)
WebFactory-3B
Setting=RL Fine-Tuning
2026.03
66
48.1
40.9
GUI-R1-3B
Setting=RL Fine-Tuning
2026.03
54.8
41.5
41.3
QwenVL2.5-3B
Setting=Zero-Shot
2026.03
38.4
27.2
27.2
GPT-4o
Setting=Zero-Shot
2026.03
37.5
14.2
5.4
Feedback
Search any
task
Search any
task