SwiftMem: Fast Agentic Memory via Query-aware Indexing

About

Agentic memory systems have become critical for enabling LLM agents to maintain long-term context and retrieve relevant information efficiently. However, existing memory frameworks suffer from a fundamental limitation: they perform exhaustive retrieval across the entire storage layer regardless of query characteristics. This brute-force approach creates severe latency bottlenecks as memory grows, hindering real-time agent interactions. We propose SwiftMem, a query-aware agentic memory system that achieves sub-linear retrieval through specialized indexing over temporal and semantic dimensions. Our temporal index enables logarithmic-time range queries for time-sensitive retrieval, while the semantic DAG-Tag index maps queries to relevant topics through hierarchical tag structures. To address memory fragmentation during growth, we introduce an embedding-tag co-consolidation mechanism that reorganizes storage based on semantic clusters to improve cache locality. Experiments on LoCoMo and LongMemEval benchmarks demonstrate that SwiftMem achieves 47$\times$ faster search compared to state-of-the-art baselines while maintaining competitive accuracy, enabling practical deployment of memory-augmented LLM agents.

Anxin Tian, Yiming Li, Xing Li, Hui-Ling Zhen, Lei Chen, Xianzhi Yu, Zhenhua Dong, Mingxuan Yuan• 2026

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	Locomo	F132	125
Open-domain Question Answering	Locomo	F10.258	111
Single-hop Question Answering	Locomo	F10.455	111
Temporal Question Answering	Locomo	F10.507	85
Long-context Reasoning	LoCoMo (test)	LLM Score65.2	7
Open Domain	LoCoMo (test)	LLM Score55.2	7
Temporal Reasoning	LoCoMo (test)	LLM Score0.685	7
Overall reasoning performance	LoCoMo (test)	LLM Score70.4	7
Single-Hop	LoCoMo (test)	LLM Score76.7	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord