How is BERT surprised? Layerwise detection of linguistic anomalies

About

Transformer language models have shown remarkable ability in detecting when a word is anomalous in context, but likelihood scores offer no information about the cause of the anomaly. In this work, we use Gaussian models for density estimation at intermediate layers of three language models (BERT, RoBERTa, and XLNet), and evaluate our method on BLiMP, a grammaticality judgement benchmark. In lower layers, surprisal is highly correlated to low token frequency, but this correlation diminishes in upper layers. Next, we gather datasets of morphosyntactic, semantic, and commonsense anomalies from psycholinguistic studies; we find that the best performing model RoBERTa exhibits surprisal in earlier layers when the anomaly is morphosyntactic than when it is semantic, while commonsense anomalies do not exhibit surprisal at any intermediate layer. These results suggest that language models employ separate mechanisms to detect different types of linguistic anomalies.

Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu, Frank Rudzicz• 2021

Related benchmarks

Task	Dataset	Result
Commonsense Anomaly Detection	Warren Commonsense	Accuracy75	6
Morphosyntax Anomaly Detection	BLIMP Subject-Verb	Accuracy97.1	6
Morphosyntax Anomaly Detection	Osterhout and Nicol Morphosyntax	Accuracy100	6
Semantic Anomaly Detection	BLIMP Animacy	Accuracy76.7	6
Morphosyntax Anomaly Detection	BLIMP Det-Noun	Accuracy98.3	6
Semantic Anomaly Detection	Pylkkänen and McElree Semantic	Accuracy93.2	6
Semantic Anomaly Detection	Warren Semantic	Accuracy0.944	6
Semantic Anomaly Detection	Osterhout and Nicol Semantic	Accuracy84.1	6
Semantic Anomaly Detection	Osterhout and Mobley Semantic	Accuracy90.6	6
Commonsense Anomaly Detection	Federmeier and Kutas Commonsense	Accuracy62.5	6

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord