Sense Representations Are Inducible Interfaces

About

Sense representations (explicit, per-token meaning decompositions) are useful for disambiguation, steering, and cross-lingual alignment, but existing approaches require models to be pretrained with sense structure baked in. We introduce ACROS, which induces an explicit sense pathway into a frozen pretrained decoder LM through a gated residual addition. On SmolLM2-360M, ACROS preserves base LM quality while supporting three uses of the same induced variables: zero-shot word-sense disambiguation (64.95 F1 on Raganato ALL, competitive with the WordNet first-sense heuristic), low-KL lexical steering across 5,161 CoInCo cases where a simple non-oracle proxy recovers about 90% of positive shifts, and SENSIA cross-lingual adaptation to four languages (mean R@1 0.988, target FLORES PPL 7.94). ACROS makes sense representations an inducible interface for ordinary pretrained LMs.

Jan Christian Blaise Cruz, Alham Fikri Aji• 2026

Related benchmarks

Task	Dataset	Result
Causal Reasoning	XCOPA	Accuracy55.8	55
Story completion	XStory	Likelihood Accuracy54.2	5
Summarization	XL-Sum (test)	ROUGE-L11	5
Word Sense Disambiguation	Raganato ALL (test)	F1 Score64.95	5
Reading Comprehension	Belebele	Likelihood Accuracy23.1	5

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord