Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning

About

Domain generalization (DG) aims to adapt a model using one or multiple source domains to ensure robust performance in unseen target domains. Recently, Parameter-Efficient Fine-Tuning (PEFT) of foundation models has shown promising results in the context of DG problem. Nevertheless, existing PEFT methods still struggle to strike a balance between preserving generalizable components of the pre-trained model and learning task-specific features. To gain insights into the distribution of generalizable components, we begin by analyzing the pre-trained weights through the lens of singular value decomposition. Building on these insights, we introduce Singular Value Decomposed Minor Components Adaptation (SoMA), an approach that selectively tunes minor singular components while keeping the residual parts frozen. SoMA effectively retains the generalization ability of the pre-trained model while efficiently acquiring task-specific skills. Moreover, we freeze domain-generalizable blocks and employ an annealing weight decay strategy, thereby achieving an optimal balance in the delicate trade-off between generalizability and discriminability. SoMA attains state-of-the-art results on multiple benchmarks that span both domain generalized semantic segmentation to domain generalized object detection. In addition, our methods introduce no additional inference overhead or regularization loss, maintain compatibility with any backbone or head, and are designed to be versatile, allowing easy integration into a wide range of tasks.

Seokju Yun, Seunghye Chae, Dongheon Lee, Youngmin Ro• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationGTA5 to {Cityscapes, Mapillary, BDD} (test)
mIoU (Cityscapes)73.63
94
Semantic segmentationCityScapes, BDD, and Mapillary (val)
Mean mIoU70.79
85
Semantic segmentationMapillary
mIoU77.87
75
Semantic segmentationACDC (test)--
47
Semantic segmentationACDC (val)
mIoU71.7
29
Semantic segmentationGTAV + Synthia (G+S) to BDD100K (B), Cityscapes (C), Mapillary (M) Sim-2-Real (test)
mIoU (BDD100K)63.59
23
Semantic segmentationMapillary Vistas
mIoU68.3
22
Semantic segmentationCityscapes, BDD100K, and Mapillary Aggregate (test)
mIoU72.98
21
Semantic segmentationBDD
mIoU68.08
19
Object DetectionS-DGOD (test)
AP (DS)69.4
13
Showing 10 of 13 rows

Other info

Code

Follow for update