Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Stochastic Optimization of Areas Under Precision-Recall Curves with Provable Convergence

About

Areas under ROC (AUROC) and precision-recall curves (AUPRC) are common metrics for evaluating classification performance for imbalanced problems. Compared with AUROC, AUPRC is a more appropriate metric for highly imbalanced datasets. While stochastic optimization of AUROC has been studied extensively, principled stochastic optimization of AUPRC has been rarely explored. In this work, we propose a principled technical method to optimize AUPRC for deep learning. Our approach is based on maximizing the averaged precision (AP), which is an unbiased point estimator of AUPRC. We cast the objective into a sum of {\it dependent compositional functions} with inner functions dependent on random variables of the outer level. We propose efficient adaptive and non-adaptive stochastic algorithms named SOAP with {\it provable convergence guarantee under mild conditions} by leveraging recent advances in stochastic compositional optimization. Extensive experimental results on image and graph datasets demonstrate that our proposed method outperforms prior methods on imbalanced problems in terms of AUPRC. To the best of our knowledge, our work represents the first attempt to optimize AUPRC with provable convergence. The SOAP has been implemented in the libAUC library at~\url{https://libauc.org/}.

Qi Qi, Youzhi Luo, Zhao Xu, Shuiwang Ji, Tianbao Yang• 2021

Related benchmarks

TaskDatasetResultRank
Graph ClassificationHIV MoleculeNet (test)
AUPRC35.47
27
Graph ClassificationMUV MoleculeNet (test)
AUPRC33.52
27
Molecule ClassificationTox21 Task 0 (test)
AUPRC0.5276
27
Molecule ClassificationTox21 Task 2 (test)
AUPRC63.33
27
Molecule ClassificationToxCast Task 8 (test)
AUPRC0.2561
27
Graph ClassificationMIT AICURES (test)
AUPRC0.6639
27
Molecule ClassificationToxCast Task 12 (test)
AUPRC0.0601
27
Image ClassificationCIFAR-10 imbalanced (test)
AUPRC76.29
18
Image ClassificationCIFAR-100 imbalanced version (test)
AUPRC0.6251
18
Medical Image ClassificationKaggle Melanoma (test)
AUPRC0.3152
18
Showing 10 of 11 rows

Other info

Code

Follow for update