Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Efficiently Identifying Task Groupings for Multi-Task Learning

About

Multi-task learning can leverage information learned by one task to benefit the training of other tasks. Despite this capacity, naively training all tasks together in one model often degrades performance, and exhaustively searching through combinations of task groupings can be prohibitively expensive. As a result, efficiently identifying the tasks that would benefit from training together remains a challenging design question without a clear solution. In this paper, we suggest an approach to select which tasks should train together in multi-task learning models. Our method determines task groupings in a single run by training all tasks together and quantifying the effect to which one task's gradient would affect another task's loss. On the large-scale Taskonomy computer vision dataset, we find this method can decrease test loss by 10.0% compared to simply training all tasks together while operating 11.6 times faster than a state-of-the-art task grouping method.

Christopher Fifty, Ehsan Amid, Zhe Zhao, Tianhe Yu, Rohan Anil, Chelsea Finn• 2021

Related benchmarks

TaskDatasetResultRank
PDE solvingKlein-Gordon equation
Relative L2 Error9.19
31
Forward PDE solvingHelmholtz
Relative Error0.012
26
Multi-task LearningRidership
Total Loss17.5
20
Multi-task LearningETTm1
Total Loss3.9
20
Multi-task LearningChemical
Total Loss4.69
20
Multi-task LearningCelebA
Total Loss49.67
20
Forward PDE problem solvingBurgers
Relative L2 Error0.0046
19
Forward PDE solvingConv-Diff 10K-epoch
Relative L2 Error2.98
16
Forward PDE solvingAllen–Cahn 10K-epoch
Relative L2 Error1.71
16
Forward PDE solvingHelmholtz 10K-epoch
Relative L2 Error5.3
16
Showing 10 of 19 rows

Other info

Follow for update