Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Self-Distilled Vision Transformer for Domain Generalization

About

In the recent past, several domain generalization (DG) methods have been proposed, showing encouraging performance, however, almost all of them build on convolutional neural networks (CNNs). There is little to no progress on studying the DG performance of vision transformers (ViTs), which are challenging the supremacy of CNNs on standard benchmarks, often built on i.i.d assumption. This renders the real-world deployment of ViTs doubtful. In this paper, we attempt to explore ViTs towards addressing the DG problem. Similar to CNNs, ViTs also struggle in out-of-distribution scenarios and the main culprit is overfitting to source domains. Inspired by the modular architecture of ViTs, we propose a simple DG approach for ViTs, coined as self-distillation for ViTs. It reduces the overfitting of source domains by easing the learning of input-output mapping problem through curating non-zero entropy supervisory signals for intermediate transformer blocks. Further, it does not introduce any new parameters and can be seamlessly plugged into the modular composition of different ViTs. We empirically demonstrate notable performance gains with different DG baselines and various ViT backbones in five challenging datasets. Moreover, we report favorable performance against recent state-of-the-art DG methods. Our code along with pre-trained models are publicly available at: https://github.com/maryam089/SDViT.

Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan• 2022

Related benchmarks

TaskDatasetResultRank
Domain GeneralizationPACS
Accuracy (Art)87.6
221
Domain GeneralizationOffice-Home
Average Accuracy71.5
63
Domain GeneralizationVLCS
Accuracy (L)64.2
27
Diabetic Retinopathy (DR) gradingMessidor
ACC62
22
Diabetic Retinopathy (DR) gradingAPTOS
Accuracy51.8
22
Domain GeneralizationTerraIncognita
L3831.7
21
Domain GeneralizationDomainNet
Clipart Accuracy63.4
16
Diabetic Retinopathy GradingMessidor-2 (test)
Accuracy62.9
15
Diabetic Retinopathy GradingAPTOS (test)
Accuracy69.3
10
Diabetic Retinopathy GradingEyepacs (test)
Accuracy72
10
Showing 10 of 12 rows

Other info

Follow for update