Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
UI Operations on AITZ
Loading...
89.6
Task Metric (TM)
Qwen2-VL-72B
69.216
74.508
79.8
85.092
Sep 18, 2024
Task Metric (TM)
Exact Match (EM)
Updated 4d ago
Evaluation Results
Method
Method
Links
Task Metric (TM)
Exact Match (EM)
Qwen2-VL-72B
Model Parameters=72B
2024.09
89.6
72.1
Previous SOTA
2024.09
83
47.7
GPT-4o
2024.09
70
35.3
Feedback
Search any
task
Search any
task