Understanding intermediate layers using linear classifier probes

About

Neural network models have a reputation for being black boxes. We propose to monitor the features at every layer of a model and measure how suitable they are for classification. We use linear classifiers, which we refer to as "probes", trained entirely independently of the model itself. This helps us better understand the roles and dynamics of the intermediate layers. We demonstrate how this can be used to develop a better intuition about models and to diagnose potential problems. We apply this technique to the popular models Inception v3 and Resnet-50. Among other things, we observe experimentally that the linear separability of features increase monotonically along the depth of the model.

Guillaume Alain, Yoshua Bengio• 2016

Related benchmarks

Task	Dataset	Result
Image Classification	Stanford Cars	Accuracy93.8	660
Image Classification	EuroSAT	Accuracy97	569
Image Classification	DTD	Accuracy83.5	487
Image Classification	SUN397	Accuracy77.2	450
Image Classification	FashionMNIST (test)	Accuracy85.8	363
Image Classification	Pets	--	308
Image Classification	GTSRB	Accuracy86.8	291
Intent Classification	Banking77	Accuracy90.9	260
Hallucination Detection	TriviaQA (test)	AUC-ROC83.36	243
Natural Language Inference	SNLI	Accuracy71.2	196

Showing 10 of 106 rows

...

Other info

Follow for update

@wizwand_team Discord