Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

About

Memory-based self-evolution has emerged as a promising paradigm for coding agents. However, existing approaches typically restrict memory utilization to homogeneous task domains, failing to leverage the shared infrastructural foundations, such as runtime environments and programming languages, that exist across diverse real-world coding problems. To address this limitation, we investigate \textbf{Memory Transfer Learning} (MTL) by harnessing a unified memory pool from heterogeneous domains. We evaluate performance across 6 coding benchmarks using four memory representations, ranging from concrete traces to abstract insights. Our experiments demonstrate that cross-domain memory improves average performance by 3.7\%, primarily by transferring meta-knowledge, such as validation routines, rather than task-specific code. Importantly, we find that abstraction dictates transferability; high-level insights generalize well, whereas low-level traces often induce negative transfer due to excessive specificity. Furthermore, we show that transfer effectiveness scales with the size of the memory pool, and memory can be transferred even between different models. Our work establishes empirical design principles for expanding memory utilization beyond single-domain silos. Project page: https://memorytransfer.github.io/

Kangsan Kim, Minki Kang, Taeil Kim, Yanlai Yang, Mengye Ren, Sung Ju Hwang• 2026

Related benchmarks

TaskDatasetResultRank
Terminal task completionTerminal-bench 2.0
Pass@128.8
52
Code GenerationAider-Polyglot
Pass@144.7
19
Software EngineeringSWE-bench Verified--
18
Code GenerationReplicationBench
Pass@327.8
13
Code GenerationLiveCodeBench v6
Pass@394
9
Code GenerationSWE-bench Verified
Pass@377
9
Code GenerationTerminalBench 2
Pass@339.3
9
Code GenerationMLGym-Bench
Pass@375
9
Machine Learning Task AutomationMLGym-Bench
Pass@163.9
9
Research ReplicationReplicationBench
Pass@117.8
9
Showing 10 of 13 rows

Other info

GitHub

Follow for update