Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

$\texttt{SEM-CTRL}$: Semantically Controlled Decoding

About

Ensuring both syntactic and semantic correctness in Large Language Model (LLM) outputs remains a significant challenge, despite being critical for real-world deployment. In this paper, we introduce $\texttt{SEM-CTRL}$, a unified approach that allows for enforcing rich context-sensitive constraints, and task and instance specific semantics directly on the LLM decoder. Our approach integrates token-level MCTS which is guided by specific syntactic and semantic constraints. The constraints over desired outputs are expressed using Answer Set Grammars, which is a logic-based formalism that generalizes context sensitive grammars while incorporating background knowledge to represent task-specific semantics. We show that our approach helps guarantee valid completions for any off-the-shelf LLM without the need for fine-tuning. We evaluate $\texttt{SEM-CTRL}$ on a range of tasks, including synthetic grammar synthesis, combinatorial reasoning, JSON parsing, and planning. Our experimental results demonstrate that $\texttt{SEM-CTRL}$ allows even small pre-trained LLMs to efficiently outperform larger variants and state-of-the-art reasoning models (e.g., $\textit{o4-mini}$) while simultaneously guaranteeing semantic validity.

Mohammad Albinhassan, Pranava Madhyastha, Alessandra Russo• 2025

Related benchmarks

TaskDatasetResultRank
Synthetic Grammar SynthesisSynthetic Grammar Synthesis a^n b^n c^n
Accuracy100
29
Combinatorial ReasoningCombinatorial Reasoning
Graph Accuracy100
12
Grammar SynthesisSynthetic Grammar Synthesis
Accuracy ($a^n b^n c^n$)100
12
PlanningBlocks (Blocksworld)
Accuracy96.8
12
Synthetic Grammar SynthesisSynthetic Grammar Synthesis (a^m b^n c^m d^n)
Accuracy100
9
Combinatorial ReasoningGraph Coloring
Accuracy100
9
Combinatorial ReasoningSudoku 3x3
Accuracy100
9
Combinatorial ReasoningSudoku 4x4
Accuracy100
9
Planning EfficiencyBlocksworld Planning
Ntokens589.3
4
Reasoning EfficiencySyllogism Generation Task (SGS)
Ntokens250.2
4
Showing 10 of 11 rows

Other info

Follow for update