Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations

About

Large language models (LLMs) have exhibited striking in-context learning (ICL) ability to adapt to target tasks with a few input-output demonstrations. For better ICL, different methods are proposed to select representative demonstrations from existing training corpora. However, such settings are not aligned with real-world practices, as end-users usually query LMs without access to demonstration pools. In this work, we introduce Self-ICL -- a simple framework which bootstraps LMs' intrinsic capabilities to perform zero-shot ICL. Given a test input, Self-ICL first prompts the model to generate pseudo-inputs. Next, the model predicts pseudo-labels for the pseudo-inputs via zero-shot prompting. Finally, we perform ICL for the test input with the pseudo-input-label pairs as demonstrations. Evaluation on 23 BIG-Bench Hard tasks shows Self-ICL outperforms zero-shot baselines on both average accuracy and head-to-head comparison. Moreover, with zero-shot chain-of-thought, Self-ICL achieves results comparable to using real demonstrations. Additionally, we conduct a range of analyses to validate Self-ICL's effectiveness and provide insights for its behaviors under different settings.

Wei-Lin Chen, Cheng-Kuang Wu, Yun-Nung Chen, Hsin-Hsi Chen• 2023

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningWinoGrande
Accuracy82.6
1085
Multi-task Language UnderstandingMMLU
Accuracy78.8
876
Commonsense ReasoningSocialIQA
Accuracy74
116
Logical reasoningLogiQA-2
Accuracy65.1
34
Image ClassificationWikiArt
Top-1 Accuracy40.4
34
Logical reasoningHELP
Accuracy65.5
14
Causal Reasoninge-CARE
Accuracy82
14
Visual Question AnsweringVQA-AD
Accuracy41
13
Image CaptioningFlickr30K
BLEU-424.7
13
Visual Question AnsweringPMC-VQA
Accuracy48
13
Showing 10 of 12 rows

Other info

Follow for update