MemGPT: Towards LLMs as Operating Systems

About

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appearance of large memory resources through data movement between fast and slow memory. Using this technique, we introduce MemGPT (Memory-GPT), a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window, and utilizes interrupts to manage control flow between itself and the user. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their performance: document analysis, where MemGPT is able to analyze large documents that far exceed the underlying LLM's context window, and multi-session chat, where MemGPT can create conversational agents that remember, reflect, and evolve dynamically through long-term interactions with their users. We release MemGPT code and data for our experiments at https://memgpt.ai.

Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez• 2023

Related benchmarks

Task	Dataset	Result
Long-context Question Answering	Locomo	F1 (Multi Hop)30.36	171
Long-term memory evaluation	Locomo	Overall F140.4	128
Multi-hop Question Answering	Locomo	F130.36	125
Single-hop Question Answering	Locomo	F10.6018	111
Open-domain Question Answering	Locomo	F10.1224	111
Long-context Memory Evaluation	LongMemEval	Average Score29.16	103
Temporal Question Answering	Locomo	F10.2552	85
Long-context Reasoning	Locomo	Average F130.88	75
Multi-hop Reasoning	Locomo	F1 Score26.65	68
Long-context Conversational Question Answering	Locomo	Multi-Hop F130.36	59

Showing 10 of 88 rows

...

Other info

Follow for update

@wizwand_team Discord