Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Learning

About

Multi-objective learning (MOL) aims to learn under multiple potentially conflicting objectives and strike a proper balance. While recent preference-guided MOL methods often rely on additional optimization objectives or constraints, we consider the classic Tchebycheff scalarization (TCH) that naturally allows for locating solutions with user-specified trade-offs. Due to its minimax formulation, directly optimizing TCH often leads to training oscillation and stagnation. In light of this limitation, we propose an adaptive online mirror descent algorithm for TCH, called (Ada)OMD-TCH. One of our main ingredients is an adaptive online-to-batch conversion that significantly improves solution optimality over traditional conversion in practice while maintaining the same theoretical convergence guarantees. We show that (Ada)OMD-TCH achieves a convergence rate of $\mathcal O(\sqrt{\log m/T})$, where $m$ is the number of objectives and $T$ is the number of rounds, providing a tighter dependency on $m$ in the offline setting compared to existing work. Empirically, we demonstrate on both synthetic problems and federated learning tasks that (Ada)OMD-TCH effectively smooths the training process and yields preference-guided, specific, diverse, and fair solutions.

Meitong Liu, Xiaoyuan Zhang, Chulin Xie, Kate Donahue, Han Zhao• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST Partial Class C=5
Accuracy94.239
15
Image ClassificationMNIST Partial Class C=5 (test)
Average Accuracy94.239
15
Image ClassificationMNIST Rotation
Average Accuracy92.593
15
ClassificationMNIST Partial Class C=2 (test)
Accuracy92.387
15
Image ClassificationCIFAR10 Partial Class C=5
Accuracy55.632
15
Image ClassificationCIFAR10 Partial Class C=2 (test)
Accuracy36.765
15
Image ClassificationCIFAR10 Rotation
Accuracy66.218
15
Image ClassificationCIFAR10 Partial Class C=2
Accuracy36.765
15
Multi-Objective OptimizationVLMOP2
Hypervolume29.2
14
Multi-Objective OptimizationF2
Hypervolume1.008
14
Showing 10 of 16 rows

Other info

Follow for update