Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Implicit In-context Learning

About

In-context Learning (ICL) empowers large language models (LLMs) to swiftly adapt to unseen tasks at inference-time by prefixing a few demonstration examples before queries. Despite its versatility, ICL incurs substantial computational and memory overheads compared to zero-shot learning and is sensitive to the selection and order of demonstration examples. In this work, we introduce Implicit In-context Learning (I2CL), an innovative paradigm that reduces the inference cost of ICL to that of zero-shot learning with minimal information loss. I2CL operates by first generating a condensed vector representation, namely a context vector, extracted from the demonstration examples. It then conducts an inference-time intervention through injecting a linear combination of the context vector and query activations back into the model's residual streams. Empirical evaluation on nine real-world tasks across three model architectures demonstrates that I2CL achieves few-shot level performance at zero-shot inference cost, and it exhibits robustness against variations in demonstration examples. Furthermore, I2CL facilitates a novel representation of task-ids, enhancing task similarity detection and fostering effective transfer learning. We also perform a comprehensive analysis and ablation study on I2CL, offering deeper insights into its internal mechanisms. Code is available at https://github.com/LzVv123456/I2CL.

Zhuowei Li, Zihao Xu, Ligong Han, Yunhe Gao, Song Wen, Di Liu, Hao Wang, Dimitris N. Metaxas• 2024

Related benchmarks

TaskDatasetResultRank
Subjectivity ClassificationSubj
Accuracy62.28
343
Text ClassificationTREC
Accuracy78.36
281
Multitask Language UnderstandingMMLU-Pro
Accuracy27.14
248
Text ClassificationMR
Accuracy92.4
174
Topic ClassificationDBpedia
Accuracy79
131
Text ClassificationSST-5
Accuracy34.92
119
Text ClassificationAGNews
Accuracy78.88
110
Natural Language InferenceaNLI
Accuracy28.01
107
Text ClassificationDBpedia
Accuracy79
104
ClassificationSST2
Accuracy90.8
102
Showing 10 of 20 rows

Other info

Follow for update