Grounding Complex Natural Language Commands for Temporal Tasks in Unseen Environments

About

Grounding navigational commands to linear temporal logic (LTL) leverages its unambiguous semantics for reasoning about long-horizon tasks and verifying the satisfaction of temporal constraints. Existing approaches require training data from the specific environment and landmarks that will be used in natural language to understand commands in those environments. We propose Lang2LTL, a modular system and a software package that leverages large language models (LLMs) to ground temporal navigational commands to LTL specifications in environments without prior language data. We comprehensively evaluate Lang2LTL for five well-defined generalization behaviors. Lang2LTL demonstrates the state-of-the-art ability of a single model to ground navigational commands to diverse temporal specifications in 21 city-scaled environments. Finally, we demonstrate a physical robot using Lang2LTL can follow 52 semantically diverse navigational commands in two indoor environments.

Jason Xinyu Liu, Ziyi Yang, Ifrah Idrees, Sam Liang, Benjamin Schornstein, Stefanie Tellex, Ankit Shah• 2023

Related benchmarks

Task	Dataset	Result
Contextually grounded translation	VERIFY corpus	SemEq1.4	11
Predicate Grounding	Traffic Light domain	F1 Score86.2	9
Predicate Grounding	Search and Rescue domain	F1 Score77.6	9
Argument Grounding	Warehouse domain	F1 Score61.8	5
NL to LTL translation	182 NL tasks distribution D	Success Rate82.87	5
NL-to-TL Translation	Traffic Light	LE Accuracy100	4
NL-to-TL Translation	Warehouse	LE Accuracy100	4
NL-to-TL Translation	Search and Rescue	LE100	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord