Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MaskClaw: Edge-Side Personalized Privacy Arbitration for GUI Agents with Behavior-Driven Skill Evolution

About

GUI agents rely on screenshots to infer intent and operate across applications, but these screenshots often contain private messages, medical records, payment credentials, and workplace-specific workflows. Privacy decisions in this setting depend on task, recipient, application state, and user role, yet static PII detectors miss these boundaries and cloud-side VLM reasoning can upload the raw screen before deciding what should be protected. We present MaskClaw, an edge-side privacy arbitrator for GUI agents. MaskClaw extracts local visual evidence, retrieves user- and task-specific policy memory, and decides Allow, Mask, or Ask before raw screenshots leave a trusted user- or organization-controlled environment. In five designed skill-evolution scenarios, it turns corrections, cancellations, and edits into reusable privacy skills checked by a sandbox gate. We introduce P-GUI-Evo, a benchmark built from real UI patterns, reconstructed HTML screens, and sanitized labels. Experiments show that pattern matching, cloud reasoning, and routing alone tend to over-confirm, over-mask, or expose raw screenshots under the same protocol. The artifact is available at https://github.com/Theodora-Y/MaskClaw.

Yanqiu Zhao, Dongying Zheng, Kaibo Huang, Yukun Wei, Zhongliang Yang, Linna Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Contextual Exposure Policy Arbitration832-sample benchmark explicit routing exposure
Accuracy717
7
Skill EvolutioniCloud photo cleanup
Best@2086.67
1
Skill EvolutionApp permission
Best@2093.67
1
Skill EvolutionHigh-value transfer
Best@2092.96
1
Skill EvolutionCalendar merge
Best@2077.7
1
Skill EvolutionDriving mode
Best@2092.78
1
Showing 6 of 6 rows

Other info

Follow for update