Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Computer Use on OSWorld (test)
Loading...
42.5
Success Rate
UI-TARS 1.5
6.828
16.089
25.35
34.611
May 11, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
UI-TARS 1.5
2025.05
42.5
OpenAI CUA
2025.05
38.1
Seed 1.5-VL
2025.05
36.7
Claude 3.7 Sonnet
2025.05
28
Qwen 2.5 VL 72B
Parameters=72B
2025.05
8.8
Kimi VL-A3B
2025.05
8.2
Feedback
Search any
task
Search any
task