Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Estimating the Accuracies of Multiple Classifiers Without Labeled Data

About

In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the reliability of these different classifiers, is it possible to consistently and computationally efficiently estimate their accuracies? Furthermore, also in a completely unsupervised manner, can one construct a more accurate unsupervised ensemble classifier? In this paper, focusing on the binary case, we present simple, computationally efficient algorithms to solve these questions. Furthermore, under standard classifier independence assumptions, we prove our methods are consistent and study their asymptotic error. Our approach is spectral, based on the fact that the off-diagonal entries of the classifiers' covariance matrix and 3-d tensor are rank-one. We illustrate the competitive performance of our algorithms via extensive experiments on both artificial and real datasets.

Ariel Jaffe, Boaz Nadler, Yuval Kluger• 2014

Related benchmarks

TaskDatasetResultRank
Multi-task Language UnderstandingMMLU
MMLU Accuracy93.7
442
Multi-task Language UnderstandingMMLU-Pro
Accuracy91.4
57
Multiple-choice Question AnsweringGPQA
Accuracy (%)60.5
44
Multiple-choice Question AnsweringGPQA Diamond
Accuracy57.1
18
Mathematical ReasoningIMO Shortlist
Accuracy59.1
8
Question AnsweringHumanity's Last Exam (HLE) curated 649-question subset (test)
Accuracy52
7
Showing 6 of 6 rows

Other info

Follow for update