[ICLR 2026]
Experience-based Knowledge Correction
for Robust Planning in Minecraft

Seungjoon Lee¹, Suhwan Kim¹, Minhyeon Oh¹, Youngsik Yoon¹, Jungseul Ok^1,2

Department of Computer Science & Engineering, POSTECH¹
Graduate School of Artificial Intelligence, POSTECH²

OpenReview Code arXiv

TL;DR

We introduce XENON, an LLM-based agent that robustly learns knowledge to achieve diverse goals, by autonomously interacting with an environment and correcting its own knowledge.

Why XENON?

1. LLMs often have flawed prior knowledge, resulting in failures to achieve goals (i.e., items in Minecraft).

LLMs often begin with flawed knowledge about dependencies (i.e., requirements) and actions, both of which are essential knowledge for successful planning to achieve goals.

2. LLMs cannot robustly correct their flawed knowledge through simple prompting.

Even when prompted to self-correct with failure feedback, LLMs fail to correct their flawed knowledge.

XENON: Experience-based knowledge correction

XENON corrects its knowledge using its own experience, without additional LLM inference for knowledge correction.
[left] Adaptive Dependency Graph (ADG) is initialized with an LLM's dependency predictions for goal items and revises flawed dependency knowledge. Case 1: If an item is permanently failed to be obtained, it is identified as a hallucinated item and its descendants are recursively revised to remove the flawed dependency. Case 2: A flawed dependency is revised by referencing similar, successfully obtained items.
[right] Failure-aware Action Memory (FAM) corrects action knowledge using past failures, preventing the agent from repeating failed actions.

Dependency and action learning experiments

The y-axis shows the fraction of goal items whose dependencies are correctly learned.
XENON robustly learns knowledge against flawed priors, outperforming previous state-of-the-art agents in dependency and action learning.

Long-horizon goal planning experiments

The numbers in the table show average success rate to achieve goals in MineRL environment.
XENON significantly outperforms previous state-of-the-art agents in long-horizon goal planning.

Citation


        @inproceedings{lee2026experience,
            title={Experience-based Knowledge Correction for Robust Planning in Minecraft},
            author={Seungjoon Lee and Suhwan Kim and Minhyeon Oh and Youngsik Yoon and Jungseul Ok},
            booktitle={The Fourteenth International Conference on Learning Representations},
            year={2026},
            url={https://openreview.net/forum?id=N22lDHYrXe}
        }

[ICLR 2026]Experience-based Knowledge Correctionfor Robust Planning in Minecraft

TL;DR

We introduce XENON, an LLM-based agent that robustly learns knowledge to achieve diverse goals, by autonomously interacting with an environment and correcting its own knowledge.

Why XENON?

1. LLMs often have flawed prior knowledge, resulting in failures to achieve goals (i.e., items in Minecraft).

LLMs often begin with flawed knowledge about dependencies (i.e., requirements) and actions, both of which are essential knowledge for successful planning to achieve goals.

2. LLMs cannot robustly correct their flawed knowledge through simple prompting.

Even when prompted to self-correct with failure feedback, LLMs fail to correct their flawed knowledge.

XENON: Experience-based knowledge correction

Dependency and action learning experiments

The y-axis shows the fraction of goal items whose dependencies are correctly learned. XENON robustly learns knowledge against flawed priors, outperforming previous state-of-the-art agents in dependency and action learning.

Long-horizon goal planning experiments

The numbers in the table show average success rate to achieve goals in MineRL environment. XENON significantly outperforms previous state-of-the-art agents in long-horizon goal planning.

Citation

[ICLR 2026]
Experience-based Knowledge Correction
for Robust Planning in Minecraft

The y-axis shows the fraction of goal items whose dependencies are correctly learned.
XENON robustly learns knowledge against flawed priors, outperforming previous state-of-the-art agents in dependency and action learning.

The numbers in the table show average success rate to achieve goals in MineRL environment.
XENON significantly outperforms previous state-of-the-art agents in long-horizon goal planning.