Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Who Said What: Modeling Individual Labelers Improves Classification

About

Data are often labeled by many different experts with each expert only labeling a small fraction of the data and each data point being labeled by several experts. This reduces the workload on individual experts and also gives a better estimate of the unobserved ground truth. When experts disagree, the standard approaches are to treat the majority opinion as the correct label or to model the correct label as a distribution. These approaches, however, do not make any use of potentially valuable information about which expert produced which label. To make use of this extra information, we propose modeling the experts individually and then learning averaging weights for combining them, possibly in sample-specific ways. This allows us to give more weight to more reliable experts and take advantage of the unique strengths of individual experts at classifying certain types of data. Here we show that our approach leads to improvements in computer-aided diagnosis of diabetic retinopathy. We also show that our method performs better than competing algorithms by Welinder and Perona (2010), and by Mnih and Hinton (2012). Our work offers an innovative approach for dealing with the myriad real-world settings that use expert opinions to define labels for training.

Melody Y. Guan, Varun Gulshan, Andrew M. Dai, Geoffrey E. Hinton• 2017

Related benchmarks

TaskDatasetResultRank
Optic Disc and Optic Cup SegmentationRIGA
Disc Segmentation Score97.5
32
Image ClassificationLabelMe (test)
Accuracy82.12
19
Biomedical Image SegmentationQUBIQ Kidney 2021 (test)
Soft Dice73.44
10
Biomedical Image SegmentationQUBIQ Prostate 2 2021 (test)
Soft Dice0.7561
10
Biomedical Image SegmentationQUBIQ Prostate 1 2021 (test)
Soft Dice87.03
10
Biomedical Image SegmentationQUBIQ Brain 2021 (test)
Soft Dice83.54
9
Biomedical Image SegmentationQUBIQ Tumor 2021 (test)
Soft Dice0.8674
9
Music Genre ClassificationMUSIC (test)
Accuracy76.58
9
Liver Steatosis DiagnosisHP-T biopsy-proven (test)
J-Statistic0.889
8
Medical Image SegmentationQUBIQ Brain Tumor T1
BDice86.45
5
Showing 10 of 17 rows

Other info

Follow for update