Robust Multi-Task Learning with Excess Risks

About

Multi-task learning (MTL) considers learning a joint model for multiple tasks by optimizing a convex combination of all task losses. To solve the optimization problem, existing methods use an adaptive weight updating scheme, where task weights are dynamically adjusted based on their respective losses to prioritize difficult tasks. However, these algorithms face a great challenge whenever label noise is present, in which case excessive weights tend to be assigned to noisy tasks that have relatively large Bayes optimal errors, thereby overshadowing other tasks and causing performance to drop across the board. To overcome this limitation, we propose Multi-Task Learning with Excess Risks (ExcessMTL), an excess risk-based task balancing method that updates the task weights by their distances to convergence instead. Intuitively, ExcessMTL assigns higher weights to worse-trained tasks that are further from convergence. To estimate the excess risks, we develop an efficient and accurate method with Taylor approximation. Theoretically, we show that our proposed algorithm achieves convergence guarantees and Pareto stationarity. Empirically, we evaluate our algorithm on various MTL benchmarks and demonstrate its superior performance over existing methods in the presence of label noise. Our code is available at https://github.com/yifei-he/ExcessMTL.

Yifei He, Shiji Zhou, Guojun Zhang, Hyokun Yun, Yi Xu, Belinda Zeng, Trishul Chilimbi, Han Zhao• 2024

Related benchmarks

Task	Dataset	Result
Depth Estimation	NYU V2	--	167
Semantic segmentation	Cityscapes	Mean IoU69.69	68
Depth Estimation	Cityscapes	Abs. Err.0.0128	65
Surface Normal Estimation	NYU V2	Mean Angular Error26.1446	65
Image Classification	CIFAR10 Rotation	Accuracy66.211	33
Semantic segmentation	NYU V2	mIoU48.46	30
Image Classification	CIFAR10 Partial Class C=2 (test)	Accuracy35.24	15
Image Classification	CIFAR10 Partial Class C=2	Accuracy35.24	15
Classification	MNIST Partial Class C=2 (test)	Accuracy91.354	15
Image Classification	CIFAR10 Partial Class C=5	Accuracy48.142	15

Showing 10 of 25 rows

Other info

Follow for update

@wizwand_team Discord