Autonomous Continual Learning for Environment Adaptation of Computer-Use Agents

About

Real-world digital environments are highly diverse and dynamic. These characteristics cause agents to frequently encounter unseen environments and distribution shifts, making continual learning in such environments essential for computer-use agents (CUAs). However, a key challenge lies in obtaining high-quality and environment-grounded training data without relying on costly human annotation. In this work, we introduce ACuRL, an Autonomous Curriculum Reinforcement Learning framework that continually adapts agents to specific environments with zero human data. The agent first explores an environment to acquire initial experiences. During subsequent iterative training, a curriculum task generator leverages these experiences together with feedback from the previous iteration to synthesize new tasks tailored for the agent's current capabilities. To provide reliable reward signals, we introduce CUAJudge, a robust automatic evaluator for CUAs that achieves 93% agreement with human judgments. Empirically, our method effectively enables both intra-environment and cross-environment continual learning, yielding 3-29% absolute performance gains on the target environments without catastrophic forgetting on others. We also show that it can mitigate performance degradation under environment changes (e.g., version updates, platform migration, and resolution shifts). Further analyses show highly sparse updates (e.g., only 20% parameters), which helps explain the effective and robust adaptation.

Tianci Xue, Zeyi Liao, Tianneng Shi, Zilu Wang, Kai Zhang, Dawn Song, Yu Su, Huan Sun• 2026

Related benchmarks

Task	Dataset	Result
Computer-use Task Execution	Impress	Success Rate40.7	19
Computer-use Task Execution	Calc	Success Rate9	19
Computer-use Task Execution	Writer	Success Rate15.6	19
Computer-use Task Execution	Thunderbird	Success Rate57.8	19
Computer-use Task Execution	KAlgebra	Success Rate28	19
Computer-use Task Execution	Celestia	Success Rate24.2	19
Computer-use Task Execution	All Environments Overall	Success Rate24.5	19
GUI task automation	Cross-Environment Adaptation Suite (Impress, Calc, Writer, Thunderbird, KAlgebra, Celestia) 1.0 (test)	Impress Score40.7	7

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord