Intra-Model Collaborative Learning of Neural Networks

About

Recently, collaborative learning proposed by Song and Chai has achieved remarkable improvements in image classification tasks by simultaneously training multiple classifier heads. However, huge memory footprints required by such multi-head structures may hinder the training of large-capacity baseline models. The natural question is how to achieve collaborative learning within a single network without duplicating any modules. In this paper, we propose four ways of collaborative learning among different parts of a single network with negligible engineering efforts. To improve the robustness of the network, we leverage the consistency of the output layer and intermediate layers for training under the collaborative learning framework. Besides, the similarity of intermediate representation and convolution kernel is also introduced to reduce the reduce redundant in a neural network. Compared to the method of Song and Chai, our framework further considers the collaboration inside a single model and takes smaller overhead. Extensive experiments on Cifar-10, Cifar-100, ImageNet32 and STL-10 corroborate the effectiveness of these four ways separately while combining them leads to further improvements. In particular, test errors on the STL-10 dataset are decreased by $9.28\%$ and $5.45\%$ for ResNet-18 and VGG-16 respectively. Moreover, our method is proven to be robust to label noise with experiments on Cifar-10 dataset. For example, our method has $3.53\%$ higher performance under $50\%$ noise ratio setting.

Shijie Fang, Tong Lin• 2021

Related benchmarks

Task	Dataset	Result
Deepfake Detection	DFDC (test)	AUC70.2	130
Fake Face Detection	Celeb-DF v2 (test)	AUC96.58	50
Deepfake Detection	FaceShifter (FSH) (test)	AUC98.66	23
Deepfake Detection	WildDeepfake (test)	AUC0.677	19
Deepfake Detection	NeuralTextures (NT) (test)	AUC (%)89.3	10
Deepfake Detection	Deepfakes (DF) (test)	AUC (%)98.98	10
Deepfake Detection	Face2Face (F2F) (test)	AUC (%)97.33	10
Deepfake Detection	FaceSwap (FS) (test)	AUC98.43	10
Deepfake Detection	NeuralTextures (NT), Deepfakes (DF), Face2Face (F2F), FaceSwap (FS), FaceShifter (FSH), CelebDFV2 (CDFv2), and Face Forensics in the Wild (FFIW10K) raw (test)	NT75.49	10
Deepfake Detection	FaceForensics++ raw, c23, c40 (test)	ACC80.7	8

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord