Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Memory Transfer Continuity on Pilot study N=50 tasks
Loading...
87
Coding Accuracy
Portable Agent Memory
28.76
43.88
59
74.12
May 10, 2026
Coding Accuracy
Q&A Accuracy
Planning Accuracy
Mean Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Coding Accuracy
Q&A Accuracy
Planning Accuracy
Mean Score
Portable Agent Memory
Transfer Pair=Claude-3...
2026.05
87
92
85
88
Portable Agent Memory
Transfer Pair=Gemini-1...
2026.05
85
91
83
86
Portable Agent Memory
Transfer Pair=GPT-4-Tu...
2026.05
83
89
81
84
No memory (baseline)
context=no historical...
2026.05
31
45
28
35
Feedback
Search any
task
Search any
task