Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-Shot Instruction Following in RL via Structured LTL Representations

About

Linear temporal logic (LTL) is a compelling framework for specifying complex, structured tasks for reinforcement learning (RL) agents. Recent work has shown that interpreting LTL instructions as finite automata, which can be seen as high-level programs monitoring task progress, enables learning a single generalist policy capable of executing arbitrary instructions at test time. However, existing approaches fall short in environments where multiple high-level events (i.e., atomic propositions) can be true at the same time and potentially interact in complicated ways. In this work, we propose a novel approach to learning a multi-task policy for following arbitrary LTL instructions that addresses this shortcoming. Our method conditions the policy on sequences of simple Boolean formulae, which directly align with transitions in the automaton, and are encoded via a graph neural network (GNN) to yield structured task representations. Experiments in a complex chess-based environment demonstrate the advantages of our approach.

Mattia Giuri, Mathias Jackermeier, Alessandro Abate• 2025

Related benchmarks

TaskDatasetResultRank
LTL Instruction FollowingChessWorld infinite-horizon ϕ∞ 1
Success Rate86
3
LTL Instruction FollowingChessWorld infinite-horizon ϕ∞ 2
Success Rate0.767
3
LTL Instruction FollowingChessWorld finite-horizon phi1 (test)
SR99.3
3
LTL Instruction FollowingChessWorld finite-horizon phi2 (test)
Success Rate95.2
3
LTL Instruction FollowingChessWorld finite-horizon phi3 (test)
SR82.6
3
LTL Instruction FollowingChessWorld finite-horizon phi4 (test)
SR0.927
3
LTL Instruction FollowingChessWorld finite-horizon phi5 (test)
Success Rate74.3
3
LTL Instruction FollowingChessWorld finite-horizon, phi6 (test)
Success Rate93.6
3
LTL Instruction FollowingChessWorld infinite-horizon ϕ∞ GF
Success Rate92.8
3
LTL Instruction FollowingChessWorld finite-horizon phi7 (test)
Success Rate91
3
Showing 10 of 10 rows

Other info

Follow for update