Positive-Unlabeled Classification under Class-Prior Shift: A Prior-invariant Approach Based on Density Ratio Estimation

About

Learning from positive and unlabeled (PU) data is an important problem in various applications. Most of the recent approaches for PU classification assume that the class-prior (the ratio of positive samples) in the training unlabeled dataset is identical to that of the test data, which does not hold in many practical cases. In addition, we usually do not know the class-priors of the training and test data, thus we have no clue on how to train a classifier without them. To address these problems, we propose a novel PU classification method based on density ratio estimation. A notable advantage of our proposed method is that it does not require the class-priors in the training phase; class-prior shift is incorporated only in the test phase. We theoretically justify our proposed method and experimentally demonstrate its effectiveness.

Shota Nakajima, Masashi Sugiyama• 2021

Related benchmarks

Task	Dataset	Result
Prior Estimation	MNIST	Estimation Error1.9	72
Prior Estimation	CIFAR	Estimation Error0.018	72
Prior Estimation	Fashion	Estimation Error2	72
Class Prior Estimation	segment	Estimation Error0.007	36
Class Prior Estimation	banknote	Estimation Error1.6	36
Class Prior Estimation	vehicle	Estimation Error3.2	36
Class Prior Estimation	Yeast	Estimation Error6.5	36
Class Prior Estimation	Waveform	Estimation Error3.9	36
Class Prior Estimation	Spambase	Estimation Error0.023	36
Class Prior Estimation	Diabetes	Estimation Error0.138	36

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord