Multi-block Min-max Bilevel Optimization with Applications in Multi-task Deep AUC Maximization
About
In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively high, especially with a large number of blocks. To tackle this challenge, we present a single-loop randomized stochastic algorithm, which requires updates for only a constant number of blocks at each iteration. Under some mild assumptions on the problem, we establish its sample complexity of $O(1/\epsilon^4)$ for finding an $\epsilon$-stationary point. This matches the optimal complexity for solving stochastic nonconvex optimization under a general unbiased stochastic oracle model. Moreover, we provide two applications of the proposed method in multi-task deep AUC (area under ROC curve) maximization and multi-task deep partial AUC maximization. Experimental results validate our theory and demonstrate the effectiveness of our method on problems with hundreds of tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Partial AUC Maximization | CIFAR100 (test) | pAUC (FPR <= 0.1)0.9262 | 5 | |
| Partial AUC Maximization | CelebA (test) | pAUC (FPR<=0.1)0.836 | 5 | |
| Partial AUC Maximization | CheXpert (test) | pAUC (FPR <= 0.1)68.27 | 5 | |
| Partial AUC Maximization | ogbg-molpcba (test) | pAUC (FPR <= 0.1)0.7374 | 5 | |
| Multi-task Deep AUC Maximization | CIFAR100 (test) | mAUC0.9272 | 2 | |
| Multi-task Deep AUC Maximization | CheXpert (test) | mAUC0.8198 | 2 | |
| Multi-task Deep AUC Maximization | CelebA (test) | mAUC91.92 | 2 | |
| Multi-task Deep AUC Maximization | ogbg-molpcba (test) | mAUC0.8406 | 2 |