IT-RUDA: Information Theory Assisted Robust Unsupervised Domain Adaptation
About
Distribution shift between train (source) and test (target) datasets is a common problem encountered in machine learning applications. One approach to resolve this issue is to use the Unsupervised Domain Adaptation (UDA) technique that carries out knowledge transfer from a label-rich source domain to an unlabeled target domain. Outliers that exist in either source or target datasets can introduce additional challenges when using UDA in practice. In this paper, $\alpha$-divergence is used as a measure to minimize the discrepancy between the source and target distributions while inheriting robustness, adjustable with a single parameter $\alpha$, as the prominent feature of this measure. Here, it is shown that the other well-known divergence-based UDA techniques can be derived as special cases of the proposed method. Furthermore, a theoretical upper bound is derived for the loss in the target domain in terms of the source loss and the initial $\alpha$-divergence between the two domains. The robustness of the proposed method is validated through testing on several benchmarked datasets in open-set and partial UDA setups where extra classes existing in target and source datasets are considered as outliers.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Melanoma Classification | D7c | AUROC57.39 | 16 | |
| Melanoma Classification | MIDc | AUROC51.68 | 16 | |
| Melanoma Classification | FITZ | AUROC53.65 | 16 | |
| Melanoma Classification | Mc | AUROC63.34 | 16 | |
| Melanoma Classification | Md Dermoscopic source (train) | Recall49.28 | 16 | |
| Melanoma Classification | HAM Dermoscopic (test) | Recall47.9 | 16 | |
| Melanoma Screening | Clinical unseen (test) | Mc9.77 | 16 | |
| Melanoma Classification | MIDd Dermoscopic (test) | Recall12.66 | 16 | |
| Melanoma Classification | D7d Dermoscopic (test) | Recall12.62 | 16 |