Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning

About

Lifelong prompt tuning has significantly advanced parameter-efficient lifelong learning with its efficiency and minimal storage demands on various tasks. Our empirical studies, however, highlights certain transferability constraints in the current methodologies: a universal algorithm that guarantees consistent positive transfer across all tasks is currently unattainable, especially when dealing dissimilar tasks that may engender negative transfer. Identifying the misalignment between algorithm selection and task specificity as the primary cause of negative transfer, we present the Similarity Heuristic Lifelong Prompt Tuning (SHLPT) framework. This innovative strategy partitions tasks into two distinct subsets by harnessing a learnable similarity metric, thereby facilitating fruitful transfer from tasks regardless of their similarity or dissimilarity. Additionally, SHLPT incorporates a parameter pool to combat catastrophic forgetting effectively. Our experiments shows that SHLPT outperforms state-of-the-art techniques in lifelong learning benchmarks and demonstrates robustness against negative transfer in diverse task sequences.

Chenyuan Wu, Gangwei Jiang, Defu Lian• 2024

Related benchmarks

TaskDatasetResultRank
Continual LearningLarge Number of Tasks
Average Performance77.49
50
Continual LearningStandard CL Benchmark
BWT (Avg Order 1-3)80.28
38
Continual LearningLong Sequence Order 2
Average Performance (AP)80.65
20
Continual LearningSuperNI Order 2
AP46.97
20
Continual LearningLong Sequence (Order 1)
AP79.4
20
Continual LearningSuperNI (Order 1)
AP44.97
20
Continual LearningStandard CL Benchmark
BWT0.00e+0
11
Continual LearningSuperNI
FWT (O1)0.32
9
Continual LearningLong Sequence
FWT (Order 1)0.62
9
Continual LearningNegative Transfer Benchmark Seq2
Avg Accuracy82.47
8
Showing 10 of 17 rows

Other info

Code

Follow for update