Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions

About

We study multi-task reinforcement learning (RL), a setting in which an agent learns a single, universal policy capable of generalising to arbitrary, possibly unseen tasks. We consider tasks specified as linear temporal logic (LTL) formulae, which are commonly used in formal methods to specify properties of systems, and have recently been successfully adopted in RL. In this setting, we present a novel task embedding technique leveraging a new generation of semantic LTL-to-automata translations, originally developed for temporal synthesis. The resulting semantically labelled automata contain rich, structured information in each state that allow us to (i) compute the automaton efficiently on-the-fly, (ii) extract expressive task embeddings used to condition the policy, and (iii) naturally support full LTL. Experimental results in a variety of domains demonstrate that our approach achieves state-of-the-art performance and is able to scale to complex specifications where existing methods fail.

Alessandro Abate, Giuseppe De Giacomo, Mathias Jackermeier, Jan Kret\'insk\'y, Maximilian Prokop, Christoph Weinhuber• 2026

Related benchmarks

TaskDatasetResultRank
LTL Instruction FollowingLetter Finite-horizon (full)
Success Rate (SR)100
19
LTL Instruction FollowingZoneEnv Finite Horizon
Success Rate (SR)96
18
LTL Instruction FollowingZones Infinite-horizon (full)
µacc18.65
14
LTL Instruction FollowingLetterWorld Finite-horizon
Success Rate (SR)100
12
LTL Instruction FollowingLetter Infinite-horizon (full)
µAcc7.13
10
LTL-guided Reinforcement LearningZones Finite-horizon (test)
Success Rate98
10
LTL-guided Reinforcement LearningLetter Finite-horizon (test)
Success Rate (SR)100
9
LTL-guided Reinforcement LearningZones Infinite-horizon (test)
µacc18.65
7
LTL-guided Reinforcement LearningLetter Infinite-horizon (test)
µAcc7.13
5
LTL Instruction FollowingLetterWorld Infinite-horizon
µAcc11.67
4
Showing 10 of 11 rows

Other info

Follow for update