Probing Classifiers: Promises, Shortcomings, and Advances
About
Probing classifiers have emerged as one of the prominent methodologies for interpreting and analyzing deep neural network models of natural language processing. The basic idea is simple -- a classifier is trained to predict some linguistic property from a model's representations -- and has been used to examine a wide variety of models and properties. However, recent studies have demonstrated various methodological limitations of this approach. This article critically reviews the probing classifiers framework, highlighting their promises, shortcomings, and advances.
Yonatan Belinkov• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Question Answering | ARC-E | Accuracy84.32 | 416 | |
| Story completion | StoryCloze | Accuracy69.94 | 73 | |
| Multiple-choice Question Answering | ARC Easy (test) | Accuracy83.99 | 68 | |
| Question Answering | CommonsenseQA (test) | Accuracy81.98 | 60 | |
| Story Cloze Test | Story Cloze (test) | Accuracy77.61 | 56 | |
| Toxicity Detection | Toxigen | Score79.62 | 53 | |
| Multiple-choice Question Answering | ARC Challenge (test) | Accuracy75.55 | 44 | |
| Multiple-choice Question Answering | OpenBookQA (test) | Accuracy83.15 | 39 | |
| Boolean Question Answering | BoolQ (test) | -- | 38 | |
| Commonsense Question Answering | CoSQA (test) | Accuracy81.64 | 18 |
Showing 10 of 18 rows