Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
About
Lifelong prompt tuning has significantly advanced parameter-efficient lifelong learning with its efficiency and minimal storage demands on various tasks. Our empirical studies, however, highlights certain transferability constraints in the current methodologies: a universal algorithm that guarantees consistent positive transfer across all tasks is currently unattainable, especially when dealing dissimilar tasks that may engender negative transfer. Identifying the misalignment between algorithm selection and task specificity as the primary cause of negative transfer, we present the Similarity Heuristic Lifelong Prompt Tuning (SHLPT) framework. This innovative strategy partitions tasks into two distinct subsets by harnessing a learnable similarity metric, thereby facilitating fruitful transfer from tasks regardless of their similarity or dissimilarity. Additionally, SHLPT incorporates a parameter pool to combat catastrophic forgetting effectively. Our experiments shows that SHLPT outperforms state-of-the-art techniques in lifelong learning benchmarks and demonstrates robustness against negative transfer in diverse task sequences.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Continual Learning | Large Number of Tasks | Average Performance77.49 | 50 | |
| Continual Learning | Standard CL Benchmark | BWT (Avg Order 1-3)80.28 | 38 | |
| Continual Learning | Negative Transfer Benchmark Seq2 | Avg Accuracy82.47 | 8 | |
| Continual Learning | Negative Transfer Benchmark Seq3 | Average Accuracy70.16 | 8 | |
| Continual Learning | Negative Transfer Benchmark Average | Average Accuracy78.67 | 8 | |
| Continual Learning | Negative Transfer Benchmark Seq1 | Average Accuracy83.37 | 8 | |
| Continual Learning | Standard CL Benchmark | BWT0.00e+0 | 7 | |
| Continual Learning | Large Number of Tasks | BWT0.00e+0 | 7 | |
| Continual Learning | Negative Transfer Benchmark | BWT0.00e+0 | 7 |