Cycle Self-Training for Domain Adaptation
About
Mainstream approaches for unsupervised domain adaptation (UDA) learn domain-invariant representations to narrow the domain shift. Recently, self-training has been gaining momentum in UDA, which exploits unlabeled target data by training with target pseudo-labels. However, as corroborated in this work, under distributional shift in UDA, the pseudo-labels can be unreliable in terms of their large discrepancy from target ground truth. Thereby, we propose Cycle Self-Training (CST), a principled self-training algorithm that explicitly enforces pseudo-labels to generalize across domains. CST cycles between a forward step and a reverse step until convergence. In the forward step, CST generates target pseudo-labels with a source-trained classifier. In the reverse step, CST trains a target classifier using target pseudo-labels, and then updates the shared representations to make the target classifier perform well on the source data. We introduce the Tsallis entropy as a confidence-friendly regularization to improve the quality of target pseudo-labels. We analyze CST theoretically under realistic assumptions, and provide hard cases where CST recovers target ground truth, while both invariant feature learning and vanilla self-training fail. Empirical results indicate that CST significantly improves over the state-of-the-arts on visual recognition and sentiment analysis benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Unsupervised Domain Adaptation | Office-Home (test) | Average Accuracy73 | 332 | |
| Unsupervised Domain Adaptation | Office-Home | Average Accuracy72.9 | 238 | |
| Domain Adaptation | VisDA 2017 (test) | Mean Class Accuracy86.5 | 98 | |
| Object Classification | VisDA synthetic-to-real 2017 | Mean Accuracy80.6 | 91 | |
| Unsupervised Domain Adaptation | SVHN → MNIST (test) | Accuracy98.2 | 41 | |
| Unsupervised Domain Adaptation | MNIST -> USPS (test) | Accuracy0.985 | 28 | |
| Domain Adaptation | DomainNet target | R->C Accuracy83.9 | 22 | |
| Image Classification | Blended-Office-Home-LMT ResNet-50 (test) | Accuracy (Clipart)58.3 | 18 | |
| Image Classification | OOD-CV 0% occlusion 1.0 | Top-1 Accuracy (Combined)84 | 15 | |
| Heart failure prediction | eICU spatial domain shift 2014-2015 (test) | AUROC87.34 | 15 |