Butterfly: One-step Approach towards Wildly Unsupervised Domain Adaptation

About

In unsupervised domain adaptation (UDA), classifiers for the target domain (TD) are trained with clean labeled data from the source domain (SD) and unlabeled data from TD. However, in the wild, it is difficult to acquire a large amount of perfectly clean labeled data in SD given limited budget. Hence, we consider a new, more realistic and more challenging problem setting, where classifiers have to be trained with noisy labeled data from SD and unlabeled data from TD -- we name it wildly UDA (WUDA). We show that WUDA ruins all UDA methods if taking no care of label noise in SD, and to this end, we propose a Butterfly framework, a powerful and efficient solution to WUDA. Butterfly maintains four deep networks simultaneously, where two take care of all adaptations (i.e., noisy-to-clean, labeled-to-unlabeled, and SD-to-TD-distributional) and then the other two can focus on classification in TD. As a consequence, Butterfly possesses all the conceptually necessary components for solving WUDA. Experiments demonstrate that, under WUDA, Butterfly significantly outperforms existing baseline methods.

Feng Liu, Jie Lu, Bo Han, Gang Niu, Guangquan Zhang, Masashi Sugiyama• 2019

Related benchmarks

Task	Dataset	Result
Digit Classification	SYND → MNIST P45 (target)	Target Domain Accuracy90.21	14
Digit Classification	SYND → MNIST S20 (target)	Accuracy (Target Domain)95.88	14
Digit Classification	SYND → MNIST S45 (target)	Target Domain Accuracy94.97	14
Digit Classification	SYND → MNIST P20 (target)	Accuracy (Target Domain)95.29	14
Sentiment Analysis	Human-sentiment WUDA target-domain 45% noise rate	Transfer B to D Accuracy56.59	8
Sentiment Analysis	human-sentiment WUDA 20% noise rate Target Domain 1.0	Transfer B→D71.84	8
Domain Adaptation	Bing -> Caltech256 (target)	Accuracy81.71	7
Domain Adaptation	Bing -> SUN (target)	Accuracy30.54	7
Domain Adaptation	Bing -> Imagenet	Accuracy75	7

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord