Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

About

Advancements in the capabilities of Large Language Models (LLMs) have created a promising foundation for developing autonomous agents. With the right tools, these agents could learn to solve tasks in new environments by accumulating and updating their knowledge. Current LLM-based agents process past experiences using a full history of observations, summarization, retrieval augmentation. However, these unstructured memory representations do not facilitate the reasoning and planning essential for complex decision-making. In our study, we introduce AriGraph, a novel method wherein the agent constructs and updates a memory graph that integrates semantic and episodic memories while exploring the environment. We demonstrate that our Ariadne LLM agent, consisting of the proposed memory architecture augmented with planning and decision-making, effectively handles complex tasks within interactive text game environments difficult even for human players. Results show that our approach markedly outperforms other established memory methods and strong RL baselines in a range of problems of varying complexity. Additionally, AriGraph demonstrates competitive performance compared to dedicated knowledge graph-based methods in static multi-hop question-answering.

Petr Anokhin, Nikita Semenov, Artyom Sorokin, Dmitry Evseev, Andrey Kravchenko, Mikhail Burtsev, Evgeny Burnaev• 2024

Related benchmarks

TaskDatasetResultRank
Question AnsweringMuSiQue (test)
EM45
76
Long-context ReasoningLocomo
Average F124.44
75
Long-context Question AnsweringLocomo
Single-Hop LLJ Score45.5
45
Question AnsweringHotpotQA (test)
EM68
18
Question AnsweringSpecsQA (test)
F1 (Factual Correctness)7.2
13
Question AnsweringSpecsQA
FC F17.2
13
LLM Agent Memory RetrievalLoCoMo v1 (full)
F1 (Multi-Hop)28.5
12
Showing 7 of 7 rows

Other info

Follow for update