Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Formal Semantic Control over Language Models

About

This thesis advances semantic representation learning to render language representations or models more semantically and geometrically interpretable, and to enable localised, quasi-symbolic, compositional control through deliberate shaping of their latent space geometry. We pursue this goal within a VAE framework, exploring two complementary research directions: (i) Sentence-level learning and control: disentangling and manipulating specific semantic features in the latent space to guide sentence generation, with explanatory text serving as the testbed; and (ii) Reasoning-level learning and control: isolating and steering inference behaviours in the latent space to control NLI. In this direction, we focus on Explanatory NLI tasks, in which two premises (explanations) are provided to infer a conclusion. The overarching objective is to move toward language models whose internal semantic representations can be systematically interpreted, precisely structured, and reliably directed. We introduce a set of novel theoretical frameworks and practical methodologies, together with corresponding experiments, to demonstrate that our approaches enhance both the interpretability and controllability of latent spaces for natural language across the thesis.

Yingji Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Conclusion GenerationEntailmentBank (test)
BLEU42
26
Mathematical ReasoningMathematics out-of-domain (test)--
26
Sentence Interpolation SmoothnessARGO randomly sampled 200 sentence pairs
Average IS0.282
22
AutoencodingMathematical expressions EVAL (test)
BLEU98
22
Natural Language InferenceEntailmentBank (test)--
20
Language modellingExplanatory sentences
BLEU65
19
Language modellingMathematical expression EVAL (test)
Exact Match100
19
DisentanglementARG0
Accuracy98
18
AutoencodingExplanatory sentences (test)
BLEU82
13
Explanatory InferenceEntailmentBank
BLEU46
12
Showing 10 of 22 rows

Other info

Follow for update