Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mixture Proportion Estimation via Kernel Embedding of Distributions

About

Mixture proportion estimation (MPE) is the problem of estimating the weight of a component distribution in a mixture, given samples from the mixture and component. This problem constitutes a key part in many "weakly supervised learning" problems like learning with positive and unlabelled samples, learning with label noise, anomaly detection and crowdsourcing. While there have been several methods proposed to solve this problem, to the best of our knowledge no efficient algorithm with a proven convergence rate towards the true proportion exists for this problem. We fill this gap by constructing a provably correct algorithm for MPE, and derive convergence rates under certain assumptions on the distribution. Our method is based on embedding distributions onto an RKHS, and implementing it only requires solving a simple convex quadratic programming problem a few times. We run our algorithm on several standard classification datasets, and demonstrate that it performs comparably to or better than other algorithms on most datasets.

Harish G. Ramaswamy, Clayton Scott, Ambuj Tewari• 2016

Related benchmarks

TaskDatasetResultRank
Mixture Proportion EstimationBinarized CIFAR
Absolute Estimation Error0.251
17
Mixture Proportion EstimationCIFAR Dog vs Cat
Abs. Estimation Error0.286
12
Mixture Proportion EstimationBinarized MNIST
Absolute Estimation Error (%)5.6
7
Mixture Proportion EstimationMNIST 17
Abs Estimation Error4.3
7
Mixture Proportion EstimationMNIST Overlap
Absolute Estimation Error0.074
6
Class Prior EstimationF-MNIST unlabeled 1 (train)
Absolute Estimation Error0.146
4
Class Prior EstimationF-MNIST-2 unlabeled (train)
Absolute Estimation Error0.106
4
Class Prior EstimationCIFAR10 unlabeled 1 (train)
Absolute Estimation Error0.115
4
Class Prior EstimationCIFAR10-2 unlabeled (train)
Absolute Estimation Error0.164
4
Class Prior EstimationSTL10 unlabeled 1 (train)
Absolute Estimation Error0.096
4
Showing 10 of 13 rows

Other info

Follow for update