Experience-based Knowledge Correction for Robust Planning in Minecraft

About

Large Language Model (LLM)-based planning has advanced embodied agents in long-horizon environments such as Minecraft, where acquiring latent knowledge of goal (or item) dependencies and feasible actions is critical. However, LLMs often begin with flawed priors and fail to correct them through prompting, even with feedback. We present XENON (eXpErience-based kNOwledge correctioN), an agent that algorithmically revises knowledge from experience, enabling robustness to flawed priors and sparse binary feedback. XENON integrates two mechanisms: Adaptive Dependency Graph, which corrects item dependencies using past successes, and Failure-aware Action Memory, which corrects action knowledge using past failures. Together, these components allow XENON to acquire complex dependencies despite limited guidance. Experiments across multiple Minecraft benchmarks show that XENON outperforms prior agents in both knowledge learning and long-horizon planning. Remarkably, with only a 7B open-weight LLM, XENON surpasses agents that rely on much larger proprietary models. Project page: https://sjlee-me.github.io/XENON

Seungjoon Lee, Suhwan Kim, Minhyeon Oh, Youngsik Yoon, Jungseul Ok• 2025

Related benchmarks

Task	Dataset	Result
Long-horizon Task Execution	Minecraft Long-horizon Tasks	Wood95	15
Long-horizon task success rate	Standard MineRL	SR (Iron)24	4
Long-horizon task success rate	Modified MineRL	Success Rate (Iron)83	3

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord