Implicit In-context Learning

About

In-context Learning (ICL) empowers large language models (LLMs) to swiftly adapt to unseen tasks at inference-time by prefixing a few demonstration examples before queries. Despite its versatility, ICL incurs substantial computational and memory overheads compared to zero-shot learning and is sensitive to the selection and order of demonstration examples. In this work, we introduce Implicit In-context Learning (I2CL), an innovative paradigm that reduces the inference cost of ICL to that of zero-shot learning with minimal information loss. I2CL operates by first generating a condensed vector representation, namely a context vector, extracted from the demonstration examples. It then conducts an inference-time intervention through injecting a linear combination of the context vector and query activations back into the model's residual streams. Empirical evaluation on nine real-world tasks across three model architectures demonstrates that I2CL achieves few-shot level performance at zero-shot inference cost, and it exhibits robustness against variations in demonstration examples. Furthermore, I2CL facilitates a novel representation of task-ids, enhancing task similarity detection and fostering effective transfer learning. We also perform a comprehensive analysis and ablation study on I2CL, offering deeper insights into its internal mechanisms. Code is available at https://github.com/LzVv123456/I2CL.

Zhuowei Li, Zihao Xu, Ligong Han, Yunhe Gao, Song Wen, Di Liu, Hao Wang, Dimitris N. Metaxas• 2024

Related benchmarks

Task	Dataset	Result
Subjectivity Classification	Subj	Accuracy62.28	343
Text Classification	TREC	Accuracy78.36	311
Multitask Language Understanding	MMLU-Pro	Accuracy27.14	303
Sentiment Analysis	SST-5 (test)	Accuracy34.6	189
Sentiment Analysis	SST-2	Accuracy88.8	175
Text Classification	MR	Accuracy92.4	174
Text Classification	AGNews	Accuracy78.88	161
Sentiment Classification	MR (test)	Accuracy71.6	154
Question Classification	TREC (test)	Accuracy78.6	150
Topic Classification	DBpedia	Accuracy79	131

Showing 10 of 45 rows

Other info

Follow for update

@wizwand_team Discord