Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sense Representations Are Inducible Interfaces

About

Sense representations (explicit, per-token meaning decompositions) are useful for disambiguation, steering, and cross-lingual alignment, but existing approaches require models to be pretrained with sense structure baked in. We introduce ACROS, which induces an explicit sense pathway into a frozen pretrained decoder LM through a gated residual addition. On SmolLM2-360M, ACROS preserves base LM quality while supporting three uses of the same induced variables: zero-shot word-sense disambiguation (64.95 F1 on Raganato ALL, competitive with the WordNet first-sense heuristic), low-KL lexical steering across 5,161 CoInCo cases where a simple non-oracle proxy recovers about 90% of positive shifts, and SENSIA cross-lingual adaptation to four languages (mean R@1 0.988, target FLORES PPL 7.94). ACROS makes sense representations an inducible interface for ordinary pretrained LMs.

Jan Christian Blaise Cruz, Alham Fikri Aji• 2026

Related benchmarks

TaskDatasetResultRank
Causal ReasoningXCOPA
Accuracy55.8
55
Story completionXStory
Likelihood Accuracy54.2
5
SummarizationXL-Sum (test)
ROUGE-L11
5
Word Sense DisambiguationRaganato ALL (test)
F1 Score64.95
5
Reading ComprehensionBelebele
Likelihood Accuracy23.1
5
Showing 5 of 5 rows

Other info

Follow for update