MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

About

Large Language Models (LLMs) have emerged as foundational infrastructure in the pursuit of Artificial General Intelligence (AGI). Despite their remarkable capabilities in language perception and generation, current LLMs fundamentally lack a unified and structured architecture for handling memory. They primarily rely on parametric memory (knowledge encoded in model weights) and ephemeral activation memory (context-limited runtime states). While emerging methods like Retrieval-Augmented Generation (RAG) incorporate plaintext memory, they lack lifecycle management and multi-modal integration, limiting their capacity for long-term knowledge evolution. To address this, we introduce MemOS, a memory operating system designed for LLMs that, for the first time, elevates memory to a first-class operational resource. It builds unified mechanisms for representation, organization, and governance across three core memory types: parametric, activation, and plaintext. At its core is the MemCube, a standardized memory abstraction that enables tracking, fusion, and migration of heterogeneous memory, while offering structured, traceable access across tasks and contexts. MemOS establishes a memory-centric execution framework with strong controllability, adaptability, and evolvability. It fills a critical gap in current LLM infrastructure and lays the groundwork for continual adaptation, personalized intelligence, and cross-platform coordination in next-generation intelligent systems.

Zhiyu Li, Shichao Song, Hanyu Wang, Simin Niu, Ding Chen, Jiawei Yang, Chenyang Xi, Huayi Lai, Jihao Zhao, Yezhaohui Wang, Junpeng Ren, Zehao Lin, Jiahao Huo, Tianyi Chen, Kai Chen, Kehang Li, Zhiqiang Yin, Qingchen Yu, Bo Tang, Hongkang Yang, Zhi-Qin John Xu, Feiyu Xiong• 2025

Related benchmarks

Task	Dataset	Result
Long-Term Conversational Memory	Locomo	Overall Acc (LoCoMo)75.8	65
Long-context Question Answering	Locomo	--	45
Multi-task Language Understanding	MMLU-Pro	MMLU Pro Engineering Acc64	41
Long-term memory evaluation	LongMemEval S (test)	KU (Knowledge Update)76.67	30
Mathematical Reasoning	AIME 25	Exact Match47	28
Single-turn Reasoning	AIME, GPQA, MMLU-Pro, ToolBench Aggregate	Average Score59	28
Mathematical Reasoning	AIME24	Exact Match47	28
Science Question Answering	GPQA	Exact Match55	28
Tool Use Reasoning	ToolBench	API Success Rate76	28
Long-term Agent Memory Evaluation	LongMemEval	SS-U93.7	15

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord