Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Experience-based Knowledge Correction for Robust Planning in Minecraft

About

Large Language Model (LLM)-based planning has advanced embodied agents in long-horizon environments such as Minecraft, where acquiring latent knowledge of goal (or item) dependencies and feasible actions is critical. However, LLMs often begin with flawed priors and fail to correct them through prompting, even with feedback. We present XENON (eXpErience-based kNOwledge correctioN), an agent that algorithmically revises knowledge from experience, enabling robustness to flawed priors and sparse binary feedback. XENON integrates two mechanisms: Adaptive Dependency Graph, which corrects item dependencies using past successes, and Failure-aware Action Memory, which corrects action knowledge using past failures. Together, these components allow XENON to acquire complex dependencies despite limited guidance. Experiments across multiple Minecraft benchmarks show that XENON outperforms prior agents in both knowledge learning and long-horizon planning. Remarkably, with only a 7B open-weight LLM, XENON surpasses agents that rely on much larger proprietary models. Project page: https://sjlee-me.github.io/XENON

Seungjoon Lee, Suhwan Kim, Minhyeon Oh, Youngsik Yoon, Jungseul Ok• 2025

Related benchmarks

TaskDatasetResultRank
Long-horizon Task ExecutionMinecraft Long-horizon Tasks
Wood95
15
Long-horizon task success rateStandard MineRL
SR (Iron)24
4
Long-horizon task success rateModified MineRL
Success Rate (Iron)83
3
Showing 3 of 3 rows

Other info

Follow for update