Share your thoughts, 1 month free Claude Pro on usSee more

GUI reasoning on Aggregate (GUI-Act-Web, OmniAct-Web, AndroidControl)

74.6Overall Accuracy

CAPO

Updated 4mo ago

Evaluation Results

Method	Links
CAPO 2025.12		74.6
GRPO 2025.12		70.79
QwenVL2.5-3B 2025.12		54.09
Os-Atlas-4B 2025.12		49.75