Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Probing Classifiers: Promises, Shortcomings, and Advances

About

Probing classifiers have emerged as one of the prominent methodologies for interpreting and analyzing deep neural network models of natural language processing. The basic idea is simple -- a classifier is trained to predict some linguistic property from a model's representations -- and has been used to examine a wide variety of models and properties. However, recent studies have demonstrated various methodological limitations of this approach. This article critically reviews the probing classifiers framework, highlighting their promises, shortcomings, and advances.

Yonatan Belinkov• 2021

Related benchmarks

TaskDatasetResultRank
Question AnsweringARC-E
Accuracy84.32
523
Toxicity DetectionToxigen
Score79.62
95
Story completionStoryCloze
Accuracy69.94
80
Multiple-choice Question AnsweringARC Easy (test)
Accuracy83.99
68
Multiple-choice Question AnsweringOpenBookQA (test)
Accuracy83.15
61
Question AnsweringCommonsenseQA (test)
Accuracy81.98
60
Multiple-choice Question AnsweringARC Challenge (test)
Accuracy75.55
57
Story Cloze TestStory Cloze (test)
Accuracy77.61
56
Attribute SteeringSentiment S
Confidence Score98.5
52
Boolean Question AnsweringBoolQ (test)--
41
Showing 10 of 22 rows

Other info

Follow for update