Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
GUI Automation on TreeCUA OOD benchmark 1.0 (test)
Loading...
3,080
SR
TreeCUA-DPO-7B
-40
770
1,580
2,390
Feb 10, 2026
SR
Updated 4d ago
Evaluation Results
Method
Method
Links
SR
TreeCUA-DPO-7B
Backbone=Qwen2.5-VL-7B...
2026.02
3,080
TreeCUA-7B
Backbone=Qwen2.5-VL-7B...
2026.02
2,670
Qwen2.5-VL-7B
Model Family=Qwen, Par...
2026.02
80
Feedback
Search any
task
Search any
task