Sense Representations Are Inducible Interfaces
About
Sense representations (explicit, per-token meaning decompositions) are useful for disambiguation, steering, and cross-lingual alignment, but existing approaches require models to be pretrained with sense structure baked in. We introduce ACROS, which induces an explicit sense pathway into a frozen pretrained decoder LM through a gated residual addition. On SmolLM2-360M, ACROS preserves base LM quality while supporting three uses of the same induced variables: zero-shot word-sense disambiguation (64.95 F1 on Raganato ALL, competitive with the WordNet first-sense heuristic), low-KL lexical steering across 5,161 CoInCo cases where a simple non-oracle proxy recovers about 90% of positive shifts, and SENSIA cross-lingual adaptation to four languages (mean R@1 0.988, target FLORES PPL 7.94). ACROS makes sense representations an inducible interface for ordinary pretrained LMs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Causal Reasoning | XCOPA | Accuracy55.8 | 55 | |
| Story completion | XStory | Likelihood Accuracy54.2 | 5 | |
| Summarization | XL-Sum (test) | ROUGE-L11 | 5 | |
| Word Sense Disambiguation | Raganato ALL (test) | F1 Score64.95 | 5 | |
| Reading Comprehension | Belebele | Likelihood Accuracy23.1 | 5 |