Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Intra-Model Collaborative Learning of Neural Networks

About

Recently, collaborative learning proposed by Song and Chai has achieved remarkable improvements in image classification tasks by simultaneously training multiple classifier heads. However, huge memory footprints required by such multi-head structures may hinder the training of large-capacity baseline models. The natural question is how to achieve collaborative learning within a single network without duplicating any modules. In this paper, we propose four ways of collaborative learning among different parts of a single network with negligible engineering efforts. To improve the robustness of the network, we leverage the consistency of the output layer and intermediate layers for training under the collaborative learning framework. Besides, the similarity of intermediate representation and convolution kernel is also introduced to reduce the reduce redundant in a neural network. Compared to the method of Song and Chai, our framework further considers the collaboration inside a single model and takes smaller overhead. Extensive experiments on Cifar-10, Cifar-100, ImageNet32 and STL-10 corroborate the effectiveness of these four ways separately while combining them leads to further improvements. In particular, test errors on the STL-10 dataset are decreased by $9.28\%$ and $5.45\%$ for ResNet-18 and VGG-16 respectively. Moreover, our method is proven to be robust to label noise with experiments on Cifar-10 dataset. For example, our method has $3.53\%$ higher performance under $50\%$ noise ratio setting.

Shijie Fang, Tong Lin• 2021

Related benchmarks

TaskDatasetResultRank
Deepfake DetectionDFDC (test)
AUC70.2
87
Fake Face DetectionCeleb-DF v2 (test)
AUC96.58
50
Deepfake DetectionFaceShifter (FSH) (test)
AUC98.66
23
Deepfake DetectionWildDeepfake (test)
AUC0.677
19
Deepfake DetectionNeuralTextures (NT) (test)
AUC (%)89.3
10
Deepfake DetectionDeepfakes (DF) (test)
AUC (%)98.98
10
Deepfake DetectionFace2Face (F2F) (test)
AUC (%)97.33
10
Deepfake DetectionFaceSwap (FS) (test)
AUC98.43
10
Deepfake DetectionNeuralTextures (NT), Deepfakes (DF), Face2Face (F2F), FaceSwap (FS), FaceShifter (FSH), CelebDFV2 (CDFv2), and Face Forensics in the Wild (FFIW10K) raw (test)
NT75.49
10
Deepfake DetectionFaceForensics++ raw, c23, c40 (test)
ACC80.7
8
Showing 10 of 11 rows

Other info

Follow for update