Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

About

Language tasks involving character-level manipulations (e.g., spelling corrections, arithmetic operations, word games) are challenging for models operating on subword units. To address this, we develop a causal intervention framework to learn robust and interpretable character representations inside subword-based language models. Our method treats each character as a typed variable in a causal model and learns such causal structures by adapting the interchange intervention training method of Geiger et al. (2021). We additionally introduce a suite of character-level tasks that systematically vary in their dependence on meaning and sequence-level context. While character-level models still perform best on purely form-based tasks like string reversal, our method outperforms character-level models on more complex tasks that blend form, meaning, and context, such as spelling correction in context and word search games. Compared with standard subword-based models, our approach also significantly improves robustness on unseen token sequences and leads to human-interpretable internal representations of characters.

Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts• 2022

Related benchmarks

TaskDatasetResultRank
Spelling CorrectionSingle Word Spelling Correction (IV)
Sequence Accuracy77.02
9
UnscrambleUnscramble (IV)
Accuracy (Seq Level)98.97
9
UnscrambleUnscramble OOV
Sequence Accuracy72.63
9
Spelling CorrectionSingle Word Spelling Correction (Real)
Accuracy (Seq-Level)0.5179
9
Spelling CorrectionSingle Word Spelling Correction (OOV)
Accuracy (Seq-Level)70.14
9
Spelling CorrectionSpelling Correction with Context Dependent Table 2.c
Sequence-level Accuracy46.55
7
Spelling CorrectionSpelling Correction with Context Independent Table 2.c
Sequence Accuracy67
7
Showing 7 of 7 rows

Other info

Code

Follow for update