| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Multimodal-Mind2Web Cross-Task | UGround-V1-7B | Element Accuracy50.7 | 16 | 4d ago | |
| VWB EG (test) | UGround-v1-7B | Grounding Accuracy0.927 | 13 | 4d ago | |
| MOTIF (test) | OS-ATLAS | Grounding Accuracy78.8 | 13 | 4d ago | |
| ScreenSpot v2 (test) | UGround-v1-7B | Grounding Accuracy88 | 13 | 4d ago | |
| ScreenSpot (test) | UGround-v1-7B | Grounding Accuracy85.9 | 13 | 4d ago | |
| FuncPred (test) | Qwen2-VL-AutoGUI702k | Grounding Accuracy65 | 13 | 4d ago | |
| Multimodal-Mind2Web (out-of-distribution) | Aria-UITH | Cross-Task Generalization57.6 | 10 | 4d ago | |
| Multimodal-Mind2Web | UGround-V1-7B | Element Accuracy49.1 | 8 | 4d ago | |
| Multimodal-Mind2Web Cross-Website | UGround-V1-7B | Element Accuracy48.1 | 8 | 4d ago | |
| ScreenSpot mobile | SphAgent | Icon/Widget Grounding Score72.6 | 6 | 4d ago |