Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GinSign: Grounding Natural Language Into System Signatures for Temporal Logic Translation

About

Natural language (NL) to temporal logic (TL) translation enables engineers to specify, verify, and enforce system behaviors without manually crafting formal specifications-an essential capability for building trustworthy autonomous systems. While existing NL-to-TL translation frameworks have demonstrated encouraging initial results, these systems either explicitly assume access to accurate atom grounding or suffer from low grounded translation accuracy. In this paper, we propose a framework for Grounding Natural Language Into System Signatures for Temporal Logic translation called GinSign. The framework introduces a grounding model that learns the abstract task of mapping NL spans onto a given system signature: given a lifted NL specification and a system signature $\mathcal{S}$, the classifier must assign each lifted atomic proposition to an element of the set of signature-defined atoms $\mathcal{P}$. We decompose the grounding task hierarchically -- first predicting predicate labels, then selecting the appropriately typed constant arguments. Decomposing this task from a free-form generation problem into a structured classification problem permits the use of smaller masked language models and eliminates the reliance on expensive LLMs. Experiments across multiple domains show that frameworks which omit grounding tend to produce syntactically correct lifted LTL that is semantically nonequivalent to grounded target expressions, whereas our framework supports downstream model checking and achieves grounded logical-equivalence scores of $95.5\%$, a $1.4\times$ improvement over SOTA.

William English, Chase Walker, Dominic Simon, Rickard Ewetz• 2025

Related benchmarks

TaskDatasetResultRank
Predicate GroundingTraffic Light domain
F1 Score100
9
Predicate GroundingSearch and Rescue domain
F1 Score100
9
Argument GroundingWarehouse domain
F1 Score94.2
5
Predicate GroundingWarehouse domain
F1 Score (%)100
4
NL-to-TL TranslationTraffic Light
LE Accuracy100
4
NL-to-TL TranslationWarehouse
LE Accuracy100
4
NL-to-TL TranslationSearch and Rescue
LE100
4
Showing 7 of 7 rows

Other info

Follow for update