Similarity of Neural Network Representations Revisited

About

Recent work has sought to understand the behavior of neural networks by comparing representations between layers and between different trained models. We examine methods for comparing neural network representations based on canonical correlation analysis (CCA). We show that CCA belongs to a family of statistics for measuring multivariate similarity, but that neither CCA nor any other statistic that is invariant to invertible linear transformation can measure meaningful similarities between representations of higher dimension than the number of data points. We introduce a similarity index that measures the relationship between representational similarity matrices and does not suffer from this limitation. This similarity index is equivalent to centered kernel alignment (CKA) and is also closely connected to CCA. Unlike CCA, CKA can reliably identify correspondences between representations in networks trained from different initializations.

Simon Kornblith, Mohammad Norouzi, Honglak Lee, Geoffrey Hinton• 2019

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText-2 (test)	PPL13.28	2333
Science Question Answering	ScienceQA	--	791
Chart Question Answering	ChartQA	--	371
Prediction-grounded correlation with output difference (JSD)	SST-2	Spearman Correlation0.66	145
Correlation to Accuracy Difference	Cora	Correlation Coefficient0.16	117
Prediction-grounded correlation with accuracy difference	ImageNet-100	Spearman Correlation0.36	111
Correlation to Accuracy Difference	Ogbn-arxiv	Correlation Coefficient0.19	93
Correlation to Model Behavior Differences	MNLI	Accuracy Correlation0.22	93
Correlation to Accuracy Difference	Flickr	Correlation Coefficient0.52	92
Correlation to Accuracy Difference (Test 1)	ImageNet-100 1.0 (test)	JSD Correlation to Accuracy Diff0.41	80

Showing 10 of 42 rows

Other info

Follow for update

@wizwand_team Discord