Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

About

Large language models (LLMs) are moving beyond static uses and are now powering agents that learn continually during their interaction with external environments. For example, agents can learn reusable skills while navigating web pages or toggling new tools. However, existing methods for skill learning often create skills that are over-specialized to a single website and fail to generalize. We introduce PolySkill, a new framework that enables agents to learn generalizable and compositional skills. The core idea, inspired by polymorphism in software engineering, is to decouple a skill's abstract goal (what it accomplishes) and its concrete implementation (how it is executed). Experiments show that our method (1) improves skill reuse by 1.7x on seen websites and (2) boosts success rates by up to 9.4% on Mind2Web and 13.9% on unseen websites, while reducing steps by over 20%. (3) In self-exploration settings without specified tasks, our framework improves the quality of proposed tasks and enables agents to learn generalizable skills that work across different sites. By enabling the agent to identify and refine its own goals, the PolySkill enhances the agent's ability to learn a better curriculum, leading to the acquisition of more generalizable skills compared to baseline methods. This work provides a practical path toward building agents capable of continual learning in adaptive environments. Our findings show that separating a skill's goal from its execution is a crucial step toward developing autonomous agents that can learn and generalize across the open web continuously. Our code can be found in https://github.com/simonucl/PolySkill.

Simon Yu, Gang Li, Weiyan Shi, Peng Qi• 2025

Related benchmarks

TaskDatasetResultRank
Web navigation and task completionWebArena (test)
Average Task Completion59.5
137
Web Agent NavigationMIND2WEB Cross-Task 1.0
Success Rate49.2
26
Web Agent NavigationMIND2WEB Cross-Domain 1.0
Success Rate46
26
Web Navigation Task CompletionMind2Web Cross-Task
Success Rate64.6
18
Web Navigation Task CompletionMind2Web (Cross-website 177)
Success Rate66.2
14
Task-free explorationWA Shopping
Success Rate43.1
10
Task-free explorationAMZ
SR (%)70.5
10
Task-free explorationTarget
SR (%)77.3
10
Web navigationMIND2WEB Cross-Website 1.0
Success Rate47.1
10
Web Navigation Task CompletionMind2Web Cross-Domain
Success Rate68.3
10
Showing 10 of 15 rows

Other info

Follow for update