Butterfly: One-step Approach towards Wildly Unsupervised Domain Adaptation
About
In unsupervised domain adaptation (UDA), classifiers for the target domain (TD) are trained with clean labeled data from the source domain (SD) and unlabeled data from TD. However, in the wild, it is difficult to acquire a large amount of perfectly clean labeled data in SD given limited budget. Hence, we consider a new, more realistic and more challenging problem setting, where classifiers have to be trained with noisy labeled data from SD and unlabeled data from TD -- we name it wildly UDA (WUDA). We show that WUDA ruins all UDA methods if taking no care of label noise in SD, and to this end, we propose a Butterfly framework, a powerful and efficient solution to WUDA. Butterfly maintains four deep networks simultaneously, where two take care of all adaptations (i.e., noisy-to-clean, labeled-to-unlabeled, and SD-to-TD-distributional) and then the other two can focus on classification in TD. As a consequence, Butterfly possesses all the conceptually necessary components for solving WUDA. Experiments demonstrate that, under WUDA, Butterfly significantly outperforms existing baseline methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Digit Classification | SYND → MNIST P45 (target) | Target Domain Accuracy90.21 | 14 | |
| Digit Classification | SYND → MNIST S20 (target) | Accuracy (Target Domain)95.88 | 14 | |
| Digit Classification | SYND → MNIST S45 (target) | Target Domain Accuracy94.97 | 14 | |
| Digit Classification | SYND → MNIST P20 (target) | Accuracy (Target Domain)95.29 | 14 | |
| Sentiment Analysis | Human-sentiment WUDA target-domain 45% noise rate | Transfer B to D Accuracy56.59 | 8 | |
| Sentiment Analysis | human-sentiment WUDA 20% noise rate Target Domain 1.0 | Transfer B→D71.84 | 8 | |
| Domain Adaptation | Bing -> Caltech256 (target) | Accuracy81.71 | 7 | |
| Domain Adaptation | Bing -> SUN (target) | Accuracy30.54 | 7 | |
| Domain Adaptation | Bing -> Imagenet | Accuracy75 | 7 |