Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Dual-Balancing for Multi-Task Learning

About

Multi-task learning aims to learn multiple related tasks simultaneously and has achieved great success in various fields. However, the disparity in loss and gradient scales among tasks often leads to performance compromises, and the balancing of tasks remains a significant challenge. In this paper, we propose Dual-Balancing Multi-Task Learning (DB-MTL) to achieve task balancing from both the loss and gradient perspectives. Specifically, DB-MTL achieves loss-scale balancing by performing logarithm transformation on each task loss, and rescales gradient magnitudes by normalizing all task gradients to comparable magnitudes using the maximum gradient norm. Extensive experiments on a number of benchmark datasets demonstrate that DB-MTL consistently performs better than the current state-of-the-art.

Baijiong Lin, Weisen Jiang, Feiyang Ye, Yu Zhang, Pengguang Chen, Ying-Cong Chen, Shu Liu, Ivor W. Tsang, James T. Kwok• 2023

Related benchmarks

TaskDatasetResultRank
Visual ReasoningV*
Accuracy35.6
52
Visual ReasoningBLINK
Jigsaw Accuracy34.7
49
Image ReasoningWeMath
Accuracy11
34
Showing 3 of 3 rows

Other info

Follow for update