Multi-Task Learning as a Bargaining Game

About

In Multi-task learning (MTL), a joint model is trained to simultaneously make predictions for several tasks. Joint training reduces computation costs and improves data efficiency; however, since the gradients of these different tasks may conflict, training a joint model for MTL often yields lower performance than its corresponding single-task counterparts. A common method for alleviating this issue is to combine per-task gradients into a joint update direction using a particular heuristic. In this paper, we propose viewing the gradients combination step as a bargaining game, where tasks negotiate to reach an agreement on a joint direction of parameter update. Under certain assumptions, the bargaining problem has a unique solution, known as the Nash Bargaining Solution, which we propose to use as a principled approach to multi-task learning. We describe a new MTL optimization procedure, Nash-MTL, and derive theoretical guarantees for its convergence. Empirically, we show that Nash-MTL achieves state-of-the-art results on multiple MTL benchmarks in various domains.

Aviv Navon, Aviv Shamsian, Idan Achituve, Haggai Maron, Kenji Kawaguchi, Gal Chechik, Ethan Fetaya• 2022

Related benchmarks

Task	Dataset	Result
Semantic segmentation	Cityscapes (test)	mIoU75.41	1252
Semantic segmentation	Cityscapes	mIoU75.41	668
Depth Estimation	NYU v2 (test)	--	435
Semantic segmentation	NYU v2 (test)	mIoU51.73	282
Surface Normal Estimation	NYU v2 (test)	Mean Angle Distance (MAD)23.21	224
Depth Estimation	NYU Depth V2	RMSE0.78	209
Semantic segmentation	NYU Depth V2 (test)	mIoU40.13	183
Semantic segmentation	NYUD v2	mIoU31.32	150
Surface Normal Prediction	NYU V2	Mean Error25.26	123
Multi-Label Classification	ChestX-Ray14 (test)	--	88

Showing 10 of 65 rows

Other info

Code

Follow for update

@wizwand_team Discord