Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

About

This paper presents a Domain-Inspired Sharpness-Aware Minimization (DISAM) algorithm for optimization under domain shifts. It is motivated by the inconsistent convergence degree of SAM across different domains, which induces optimization bias towards certain domains and thus impairs the overall convergence. To address this issue, we consider the domain-level convergence consistency in the sharpness estimation to prevent the overwhelming (deficient) perturbations for less (well) optimized domains. Specifically, DISAM introduces the constraint of minimizing variance in the domain loss, which allows the elastic gradient calibration in perturbation generation: when one domain is optimized above the averaging level \textit{w.r.t.} loss, the gradient perturbation towards that domain will be weakened automatically, and vice versa. Under this mechanism, we theoretically show that DISAM can achieve faster overall convergence and improved generalization in principle when inconsistent convergence emerges. Extensive experiments on various domain generalization benchmarks show the superiority of DISAM over a range of state-of-the-art methods. Furthermore, we show the superior efficiency of DISAM in parameter-efficient fine-tuning combined with the pretraining models. The source code is released at https://github.com/MediaBrain-SJTU/DISAM.

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao, Ya Zhang, Yanfeng Wang• 2024

Related benchmarks

TaskDatasetResultRank
Domain GeneralizationDomainBed (out-of-domain)
VLCS Accuracy82.7
55
Voice SteganalysisVoice Steganalysis QIM, PMS, LSB, AHCM
QIM Accuracy85.11
16
Noisy Attribute GeneralizationVLCS ID/OOD
ID Accuracy89.6
15
Noisy Attribute GeneralizationCHAMMI-CP ID/OOD
ID Score72.4
15
Showing 4 of 4 rows

Other info

Follow for update