AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

About

Advancements in the capabilities of Large Language Models (LLMs) have created a promising foundation for developing autonomous agents. With the right tools, these agents could learn to solve tasks in new environments by accumulating and updating their knowledge. Current LLM-based agents process past experiences using a full history of observations, summarization, retrieval augmentation. However, these unstructured memory representations do not facilitate the reasoning and planning essential for complex decision-making. In our study, we introduce AriGraph, a novel method wherein the agent constructs and updates a memory graph that integrates semantic and episodic memories while exploring the environment. We demonstrate that our Ariadne LLM agent, consisting of the proposed memory architecture augmented with planning and decision-making, effectively handles complex tasks within interactive text game environments difficult even for human players. Results show that our approach markedly outperforms other established memory methods and strong RL baselines in a range of problems of varying complexity. Additionally, AriGraph demonstrates competitive performance compared to dedicated knowledge graph-based methods in static multi-hop question-answering.

Petr Anokhin, Nikita Semenov, Artyom Sorokin, Dmitry Evseev, Andrey Kravchenko, Mikhail Burtsev, Evgeny Burnaev• 2024

Related benchmarks

Task	Dataset	Result
Question Answering	MuSiQue (test)	EM45	76
Long-context Reasoning	Locomo	Average F124.44	75
Long-context Question Answering	Locomo	Single-Hop LLJ Score45.5	45
Question Answering	HotpotQA (test)	EM68	18
Question Answering	SpecsQA (test)	F1 (Factual Correctness)7.2	13
Question Answering	SpecsQA	FC F17.2	13
LLM Agent Memory Retrieval	LoCoMo v1 (full)	F1 (Multi-Hop)28.5	12

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord