Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Contrastive Chain-of-Thought Prompting

About

Despite the success of chain of thought in enhancing language model reasoning, the underlying process remains less well understood. Although logically sound reasoning appears inherently crucial for chain of thought, prior studies surprisingly reveal minimal impact when using invalid demonstrations instead. Furthermore, the conventional chain of thought does not inform language models on what mistakes to avoid, which potentially leads to more errors. Hence, inspired by how humans can learn from both positive and negative examples, we propose contrastive chain of thought to enhance language model reasoning. Compared to the conventional chain of thought, our approach provides both valid and invalid reasoning demonstrations, to guide the model to reason step-by-step while reducing reasoning mistakes. To improve generalization, we introduce an automatic method to construct contrastive demonstrations. Our experiments on reasoning benchmarks demonstrate that contrastive chain of thought can serve as a general enhancement of chain-of-thought prompting.

Yew Ken Chia, Guizhen Chen, Luu Anh Tuan, Soujanya Poria, Lidong Bing• 2023

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningHellaSwag
Accuracy38.7
1891
Commonsense ReasoningPIQA
Accuracy65.9
751
Mathematical ReasoningGSM8K
Accuracy79
499
Commonsense ReasoningWinoGrande
Accuracy55.2
372
Commonsense ReasoningCSQA
Accuracy67
366
Commonsense ReasoningStrategyQA
Accuracy66.2
174
Mathematical ReasoningAQUA-RAT
Accuracy57.5
120
Commonsense ReasoningSIQA
Accuracy65
106
Mathematical ReasoningGSM-Hard
Accuracy39.4
46
Common Sense ReasoningNoRa Irrelevant Rationales
Accuracy50.2
40
Showing 10 of 22 rows

Other info

Follow for update