Feature-Adaptive and Data-Scalable In-Context Learning

About

In-context learning (ICL), which promotes inference with several demonstrations, has become a widespread paradigm to stimulate LLM capabilities for downstream tasks. Due to context length constraints, it cannot be further improved in spite of more training data, and general features directly from LLMs in ICL are not adaptive to the specific downstream task. In this paper, we propose a feature-adaptive and data-scalable in-context learning framework (FADS-ICL), which can leverage task-adaptive features to promote inference on the downstream task, with the supervision of beyond-context samples. Specifically, it first extracts general features of beyond-context samples via the LLM with ICL input form one by one, and introduces a task-specific modulator to perform feature refinement and prediction after fitting a specific downstream task. We conduct extensive experiments on FADS-ICL under varying data settings (4$\sim$128 shots) and LLM scale (0.8$\sim$70B) settings. Experimental results show that FADS-ICL consistently outperforms previous state-of-the-art methods by a significant margin under all settings, verifying the effectiveness and superiority of FADS-ICL. For example, under the 1.5B and 32 shots setting, FADS-ICL can achieve \textbf{+14.3} average accuracy from feature adaptation over vanilla ICL on 10 datasets, with \textbf{+6.2} average accuracy over the previous state-of-the-art method, and the performance can further improve with increasing training data. Code and data are publicly available at \url{https://github.com/jiahaozhenbang/FADS-ICL}.

Jiahao Li, Quan Wang, Licheng Zhang, Guoqing Jin, Zhendong Mao• 2024

Related benchmarks

Task	Dataset	Result
Natural Language Inference	RTE	Accuracy83.6	590
Subjectivity Classification	Subj	Accuracy96.4	343
Question Classification	TREC	Accuracy95.2	262
Topic Classification	AG-News	Accuracy90.5	225
Sentiment Analysis	SST-2	Accuracy95.7	165
Sentiment Analysis	MR	Accuracy0.938	160
Opinion Polarity Detection	MPQA	Accuracy91.1	158
Sentiment Analysis	CR	Accuracy96.4	141
Topic Classification	DBpedia	Accuracy99.1	131
Natural Language Inference	CB	Accuracy98.2	129

Showing 10 of 11 rows

Other info

Code

Follow for update

@wizwand_team Discord